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ABSTRACT 

An  algorithm  for  sizing  transistors  for  static  Complementary-symmetry  Metal- 
Oxide- Semiconductor  (CMOS)  integrated  circuit  logic  design  using  silicon  gate 
enhancement  mode  Field-Effect  Transistors  (FET)  is  derived  and  implemented 
in  software.  The  algorithm  is  applied  to  the  mask  level  hardware  design  of  a 
three  micron  minimum  feature  size  p  well  high-speed  super  buffer.  A  software 
representation  of  the  super  buffer  can  be  used  for  the  automated  design  of  custom 
Very-Large-Scale-Integrated  (VLSI)  circuits. 
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I. 


The  MacPITTS  silicon  compiler  is  a  tool  for  the  automated  design  of  complete, 
Very-Large-Scale  Integrated  (VLSI)  microchips  that  was  created  at  the  Massachu¬ 
setts  Institute  of  Technology  Lincoln  Laboratory  in  the  early  1980’s  [Massachusetts 
Institute  of  Technology  1982  Conference  on  Advanced  Research  in  VLSI,  1982,  pp. 
28-29].  To  use  MacPITTS  a  design  engineer  specifies  the  desired  logic  behavior 
of  a  complete  microchip  in  a  source  program  written  in  a  high  level  algorithmic 
language.  Writing  this  program  is  the  only  task  that  the  design  engineer  need 
perform — the  entire  design  process  is  then  performed  by  the  compiler.  The  com¬ 
piler  extracts  information  from  this  program  which  it  uses  to  create  a  mask  level 
layout  in  Caltech  Intermediate  Form  (CIF).  The  layout  when  specified  in  this  form 
is  suitable  for  use  by  silicon  foundries  for  the  fabrication  of  an  actual  microchip. 

Use  of  a  silicon  compiler  provides  dramatic  design  time  savings  over  handcraft¬ 
ing  layouts.  The  design — simulate — redesign  cycle  that  occurs  before  fabrication  of 
handcrafted  layouts  is  lengthy  and  therefore  expensive.  Handcrafted  designs  that 
take  many  months  of  man-hours  to  complete  using  computer  based  engineering 
design  tools  may  be  completed  in  a  much  shorter  time  by  using  a  silicon  com¬ 
piler.  Most  VLSI  designs  today  contain  20,000  to  50,000  devices.  When  human 
engineers  design  these  large  chips  they  partition  and  design  the  project  in  teams. 
Integrating  the  subsystems  into  a  final  working  design  requires  close  coordination 
throughout  the  design  process  and  a  uniform  approach  on  the  part  of  all  the  design 
teams.  Using  a  silicon  compiler  to  design  automatically  institutes  uniformity  in 
the  design  process.  The  compiler  uses  embedded  techniques  and  rigorous  checking 


and  verification  of  design  practices  to  ensure  that  the  resulting  circuit  is  “correct 
by  construction.”  These  embedded  functions  often  incorporate  the  practices  of 
expert  integrated  circuit  designers.  A  good  compiler  can  verify  the  functionality 
and  timing  of  a  design  after  it  has  been  implemented.  Some  can  provide  real-time 
checking,  verification  and  implementation  strategies  that  flag  potential  errors  and 
hazards  and  prevent  them  from  being  entered  into  the  design.  All  of  these  features 
of  silicon  compilers  contribute  to  the  production  of  designs  that  are  as  correct  as 
possible  before  fabrication.  This  is  extremely  important  in  VLSI  design  because 
it  is  difficult  and  costly  to  isolate  design  errors  on  a  finished  microchip.  [Pollack, 
Erickson,  and  Mazor,  1986,  pp.  79-80] 

The  MacPITTS  Silicon  Compiler  is  actually  a  collection  of  programs  each  of 
which  performs  a  specific  function.  The  base  language  of  these  programs  is  Franz 
Lisp  with  extended  functions  for  geometric  cell  layout  provided  by  the  Massachu¬ 
setts  Institute  of  Technology  Lincoln  Laboratory  language  L5  and  by  MacPITTS 
itself.  One  way  in  which  the  programs  can  be  categorized  is  to  lump  together  all 
those  that  are  dependent  upon  a  particular  microchip  fabrication  technology  and 
those  that  are  not.  By  altering  only  the  technology  dependent  programs  to  reflect 
the  design  rules  of  a  desired  technology  the  rest  of  the  compiler  can  become  tech¬ 
nology  independent.  This  is  a  very  desirable  feature  in  the  fast  moving  electronics 
industry  because  a  favored  technology  today  may  be  outmoded  tomorrow.  The 
technology  dependent  programs  in  the  original  MacPITTS  compiler  were  based 
on  n  channel  Metal-Oxide-Semiconductor  (NMOS)  technology.  NMOS  has  since 
been  largely  replaced  by  Complementary-symmetry  Metal-Oxide-Semiconductor 
(CMOS)  technology  as  the  commercial  technology  of  choice  for  low  power  consump¬ 
tion  VLSI  applications.  This  transition  in  industry  has  occurred  largely  because 


CMOS  technology  has  matured  to  the  point  that  it  now  rivals  NMOS  in  switch¬ 
ing  speed  yet  consumes  only  a  fraction  of  the  power  required  by  NMOS  circuitry. 
It  is  therefore  a  logical  step  to  modify  the  technology  dependent  portions  of  the 
MacPITTS  Silicon  Compiler  to  support  CMOS  design  rules. 

One  of  the  major  technology  dependent  portions  of  MacPITTS  is  the  body  of 
code  within  it  that  defines  the  geometric  layout  of  logic  cells.  The  compiler  selects, 
places,  and  interconnects  these  cells  in  such  a  way  that  the  result  is  a  design  for  a 
physically  achievable  microchip  that  will  behave  as  specified  by  the  word  picture 
that  the  design  engineer  presents  to  the  compiler  in  his  or  her  source  program. 
These  cells  reside  in  what  is  essentially  a  library. 

An  ongoing  project  at  the  Naval  Postgraduate  School  in  Monterey,  California, 
has  been  the  addition  of  a  CMOS  option  to  the  technology  dependent  portions 
of  the  MacPITTS  Silicon  Compiler.  Since  the  work  has  progressed  to  the  point 
that  the  present  compiler  is  a  significant  departure  from  the  original  MacPITTS 
the  hybrid  NMOS/CMOS  compiler  is  now  referred  to  as  the  Monterey  Silicon 
Compiler. 

The  immediate  goal  of  the  research  documented  in  this  thesis  is  to  create 
and  insert  into  the  Monterey  Silicon  Compiler  code  a  mask  level  integrated  circuit 
design  of  a  high-speed  static  CMOS  super  buffer  to  replace  the  existing  NMOS 
version.  In  the  Monterey  Silicon  Compiler  architecture  the  super  buffer  is  that 
part  of  the  branching  fan-out  tree  that  feeds  clock  signals  from  off-cliip  to  various 
parts  of  a  VLSI  circuit.  Since  the  super  buffer  is  a  gate  in  the  path  of  incoming 
clock  signals  it  must  be  guaranteed  to  function  within  specified  timing  constraints 
under  given  loading  lest  the  operation  of  an  entire  chip  fail  due  to  the  inability 
of  the  clock  circuit  to  drive  the  on  chip  storage  registers  within  an  acceptable 
amount  of  time  delay.  From  the  designer’s  point  of  view  the  timing  characteristics 
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of  enhancement  Metal-Oxide-Semiconductor  Field-Effect  Transistors  (MOSFET), 
of  which  the  super  buffer  is  made,  are  controlled  chiefly  by  the  mask  geometry 
of  the  MOSFETs.  Therefore,  a  major  portion  of  this  research  is  devoted  to  an 
analysis  of  transistor  sizing  for  static  CMOS  logic  gates  constructed  from  silicon 
gate  enhancement  MOSFETs. 

Transistor  sizing  is  the  principal  means  by  which  an  integrated  circuit  design 
engineer  can  influence  the  timing  of  a  MOSFET.  In  a  silicon  compiler  the  design 
engineer  is  a  computer.  Typically,  human  engineers  achieve  desired  timing  through 
a  design — simulate — redesign  iterative  method  based  on  rules  of  thumb  that  are 
handed  down  by  successful  engineers  to  junior  engineers.  A  silicon  compiler  that 
strives  to  optimize  signal  input  to  signal  output  delay  time  on  a  microchip  cannot 
work  in  this  way — it  must  be  given  explicit  instructions  on  how  to  decide  what 
transistor  sizes  are  required  to  drive  a  given  load,  with  a  given  supply  voltage  and 
operating  temperature,  within  a  specified  delay  time.  Chapter  II,  which  deals  with 
the  transistor  sizing  problem,  is  an  original  contribution  to  the  field  of  time  delay 
optimization  in  silicon  compiler  generated  custom  CMOS  VLSI  circuits.  A  coher¬ 
ent  body  of  mathematical  equations  that  detail  how  to  size  transistors  to  achieve 
desired  timing  parameters  under  known  loading  conditions,  operating  temperature, 
and  supply  voltage  is  derived  in  Chapter  II.  The  equations  are  applicable  to  general 
static  CMOS  design. 

Although  the  Monterey  Silicon  Compiler  is  not  capable  of  designing  a  circuit 
that  is  constructed  specifically  to  achieve  a  designer  specified  time  delay  this  is 
a  topic  applicable  to  the  general  field  of  silicon  compilation.  The  equations  in 
Chapter  II  are  useful  to  the  Monterey  Silicon  Compiler  as  a  design  tool  for  the 
construction  of  the  individual  logic  cells  in  the  library  that  the  compiler  uses  to 
build  an  integrated  circuit  floorplan.  The  equations  developed  are  implemented  in 
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A.  CHARACTERISTICS  OF  STATIC  CMOS  CIRCUIT  DESIGN 

Static  CMOS  Logic  gates  are  made  from  two  distinct  blocks  of  devices  that 
are  connected  together.  Within  one  block  axe  series  combinations  of  p  MOSFETs 
or  parallel  combinations  of  p  MOSFETs  or  both.  The  MOSFETs  in  the  p  block 
are  connected  to  a  supply  voltage. 
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Figure  2.1  Schematic  of  a  Possible  p  Block  Configuration  in  a  Static 
CMOS  Logic  Gate 


Figure  2.1  shows  an  example  of  a  possible  p  block  configuration  [Weste  and 
Eshraghian,  1985,  p.  16].  Within  the  other  block  are  series  combinations  of  n 
MOSFETs  or  parallel  combinations  of  n  MOSFETs  or  both.  The  MOSFETs  in 


Figure  2.2  Schematic  of  a  Possible  n  Block  Configuration  in  a  Static 
CMOS  Logic  Gate 

the  n  block  are  connected  to  ground.  Figure  2.2  shows  am  example  of  a  possible  n 
block  configuration.  [Weste  and  Eshraghian,  1985,  p.  16] 

Any  load  capacitance  to  be  driven  by  a  static  CMOS  logic  gate  is  attached  to 
the  node  defined  by  the  interconnection  of  the  p  and  n  blocks.  Figure  2.3  shows 
the  interconnection  of  the  two  blocks  with  a  capacitive  load  attached. 

The  simplest  static  CMOS  logic  gate  is  an  inverter.  The  inverter  shown  in 
Figure  2.4  consists  of  a  single  p  MOSFET  and  a  single  n  MOSFET.  In  an  ideal 
sense,  each  p  MOSFET  in  a  static  CMOS  logic  gate  may  be  thought  of  as  a  switch 
along  a  path  connecting  the  supply  voltage  to  the  load  capacitance  [Weste  and 
Eshraghian,  1985,  p.  7].  Similarly,  each  n  MOSFET  may  be  thought  of  as  a  switch 
along  a  path  connecting  the  load  capacitance  to  ground. 

Figure  2.5  illustrates  that  a  MOSFET  that  is  conducting  can  be  thought  of 
as  a  closed  switch  and  that  a  MOSFET  that  is  in  cutoff  can  be  thought  of  as  an 
open  switch.  Therefore,  the  voltage  across  the  terminals  of  the  load  capacitance 
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Figure  2.3  Interconnection  of  p  and  n  Blocks  in  Static  CMOS  Logic 
Gates  with  Capacitive  Loading 
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Figure  2.4  Schematic  of  a  Static  CMOS  Inverter  without  Capacitive 
Loading 


will  be  driven  to  the  logic  high  level  if  enough  switches  in  the  p  block  are  closed  to 
create  at  least  one  complete  path  from  the  supply  voltage  to  the  load  capacitance 
and  enough  switches  in  the  n  block  are  opened  to  sever  ail  paths  between  the  load 
capacitance  and  ground.  Figure  2. 6. a  illustrates  this  switch  level  behavior  for  the 
inverter  in  Figure  2.4.  The  voltage  across  the  terminals  of  the  load  capacitance 
will  be  driven  to  the  logic  low  level  if  enough  switches  in  the  n  block  are  closed  to 
create  at  least  one  path  from  the  load  capacitance  to  ground  and  enough  switches 
in  the  p  block  are  opened  to  sever  all  paths  between  the  supply  voltage  and  the 
load.  Figure  2.6.b  illustrates  this  switch  level  behavior  for  the  inverter  in  Figure 


B.  DERIVATION  OF  EQUATIONS  FOR  TRANSISTOR  SIZING  IN 
A  STATIC  CMOS  INVERTER 

1.  Overview 

In  the  physical  world  MOSFETs  are  not  ideal  switches.  In  addition  to  the 
capacitive  load  that  they  must  drive  they  have  internal  parasitic  capacitances  and 
resistances  that  must  also  be  driven.  These  parasitic  capacitances  and  resistances 
are  chiefly  determined  by  operating  temperature,  various  parameters  associated 
with  the  fabrication  process,  and  the  physical  dimensions  of  the  MOSFETs.  Only 
the  third  factor,  the  physical  dimensions  of  the  MOSFETs,  is  under  the  control  of 
a  circuit  designer. 

It  is  essential  that  the  circuit  designer,  be  it  a  human  or  an  automated  CAD 
tool  like  a  silicon  compiler,  have  a  method  to  determine  the  physical  dimensions 
of  the  MOSFETs  in  a  static  CMOS  logic  gate  that  are  required  to  drive  the  load. 
The  method  presented  here  finds  the  dimensions  of  the  p  and  n  MOSFETs  in 
the  inverter  of  Figure  2.4  that  are  required  to  drive  the  parasitics  and  known 
capacitive  load  within  a  designer  specified  time  constraint.  The  inverter  is  used 
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Figure  2.5  Switch  Level  Idealization  of  MOSFET  Behavior 


for  the  development  of  this  method  because  it  is  the  simplest  static  CMOS  logic 
gate  and  as  such  the  method  developed  for  the  inverter  can  be  easily  extended  to 
general  static  CMOS  logic  design,  the  topic  of  Section  C. 

2.  Rise  Time  and  Fall  Time  Determination 

Table  2.1  gives  the  ideal  (first  order)  equations  of  the  Shichman-Hodges 
model  describing  the  behavior  of  MOSFETs  in  their  three  regions  of  operation: 
cutoff  region,  linear  region,  and  saturation  region  [Hodges  and  Jackson,  19S3.  p. 
51,  and  MicroSim  Corp.,  1987,  p.  78].  Table  2.1  refers  to  the  variable  I\P{T}. 
KP  is  one  of  the  58  variables  used  in  the  SPICE  circuit  simulation  program,  a 
standard  CAD  tool  for  the  design  and  simulation  of  MOSFET  circuits.  Table 
2.2  presents  only  those  MOSFET  model  parameters  used  in  this  thesis  as  defined 
in  MicroSim  Corporation’s  PSpice  program  documentation  [MicroSim  Corp.,  1987, 
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Figure  2.6  Switch  Level  Idealization  of  Static  CMOS  Inverter  Behavior 

pp.  74-76].  The  model  parameters  cam  be  grouped  into  two  broad  categories:  those 
that  represent  the  mask  geometry  of  the  on-chip  devices  and  those  that  represent 
the  electrical  and  metallurgical  properties  of  the  chip  due  to  the  particular  type  of 
fabrication  process  used  to  construct  the  microchip.1  As  stated  in  Section  B.l, 

1  A  good  discussion  of  the  physical  meaning  of  the  SPICE  MOSFET  model 
parameters  can  be  found  in,  “The  Simulation  of  MOS  Integrated  Circuits  Using 
SPICE2,  Memorandum  No.  M80/7,  February,  1980”,  by  Vladimirescu  and  Liu. 
The  paper  is  available  through  the  Electronics  Research  Laboratory  Industrial 
Support  Office  at  the  University  of  California  at  Berkeley. 


V 

1*! 

[ 

I 

% 

g 

s 


t; 

•«! 

f 

v 

S 


!  «■ 


mmmm 


the  circuit  designer  is  able  to  modify  parameters  in  the  first  category  but  not  those 
in  the  second. 

Some  of  these  parameters  axe  temperature  dependent.  The  effects  of  tem¬ 
perature  on  those  SPICE  MOSFET  model  parameters  used  in  this  thesis  are  de¬ 
fined  by  the  equations  in  Table  2.3.  These  equations  axe  drawn  from  the  program 
documentation  for  PSpice  by  MicroSim  Corporation.  [MicroSim  Corp.,  1987,  p. 
80] 

Note  that  positive  drain  current  in  an  n  channel  enhancement  MOSFET 
flows  from  drain  to  source.  This  is  illustrated  in  Figure  2.7.a.  In  a  p  channel 
enhancement  MOSFET  it  flows  from  source  to  drain  [Sedra  and  Smith,  19S2,  p. 
311],  This  is  illustrated  in  Figure  2.7.b. 

Section  B.l  states  that  the  load  on  a  MOSFET  is  capacitive.  Switching 
from  one  logic  level  to  another  therefore  involves  the  charging  or  discharging  of 
the  paxasitic  and  load  capacitors  and  this  requires  time.  The  time  that  it  takes  a 
voltage  waveform  to  rise  from  10%  to  90%  of  its  steady  state  value  is  defined  to  be 
its  rise  time,  denoted  tr.  Fall  time,  denoted  tj ,  is  defined  to  be  the  time  it  takes 
a  voltage  waveform  to  fall  from  90%  to  10%  of  its  steady  state  value.  [Weste  and 
Eshraghian,  1985,  p.  137] 

The  following  discussion  is  based  on  a  method  of  finding  the  rise  and  fall 
time  of  a  static  CMOS  inverter  proposed  by  Weste  and  Eshraghian  [Weste  and 
Eshraghian,  1985,  pp.  137-140].  Given  the  circuit  of  Figure  2.8  in  which  C{  is 
initially  charged  so  that  V0  —  Vdd ,  a  pulse  of  magnitude  Vdd  is  applied  to  Vin. 
The  effect  of  the  pulse  on  the  circuit’s  behavior  is  plotted  in  Figure  2. 9. a.  In  this 
figure  the  regions  of  operation  of  the  n  MOSFET  and  p  MOSFET  are  labeled  in 
accordance  with  the  definitions  of  the  regions  presented  in  Table  2.1.  From  Figure 
2.9. a  it  is  evident  that  the  p  MOSFET  immediately  enters  the  cutoff  region  where 
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TABLE  2.1  MOSFET  DRAIN  CURRENT  EQUATIONS 
(FROM  MICROSIM  CORPORATION’S  PSPICE) 


DEVICE 

TYPE  REGION 


Vdsn  >  0 

Vgsn-Vt°n{T}<  0 
[cutoff  region]  _ 


Vdsn  >  0 

Vdsn  <  Vgs„  -  Vton{T) 
[linear  region] _ 


Vdsn  >  0 

0  <  Vgsn  -  Vton{T }  <  Vdsn 
[saturation  region] 


\Vdsp\>0 

\Vgsp\-\Vtop{T}\<0 
[cutoff  region] 


\Vdsp\  >  0 

\Vdsp\<\Vgsp\-\Vtop{T}\ 
[linear  region] 


\Vdsp\>0 

0<\Vgs,\-\Vtop{T}\<\Vdst 
[saturation  region] 


DRAIN  CURRENT 


=0 


VtoAT)  |) 


VtoJT}  =  YTOn  +  GAMMA „  ■  {( PHIJT }  +  \Vbsjy  -  (PHIJT)) 
\Yto„{T}\  =  \  VTOp\  +  GAM  M Ap  ■  ({PHIJT}  -u  1 7,s#J,/2  - 


*  L  =  L  -2-. 
" 

f  L  =  L  -  2  • 


<a>Poaitiua  Drain 
Currant  Flow 
i n  an  n  MOSFET. 


<b>Positiya  Drain 
Currant  Flout 
map  MOSFET . 


Figure  2.7  Current  Flow  in  MOSFETs 

/drainp  =  0.  The  n  MOSFET  passes  through  two  regions  as  VQ  fails:  the  saturation 
region,  then  the  linear  region.  It  is  possible  then  to  redraw  Figure  2.8  as  Figure 
2.9.b  during  fall  time  with  the  understanding  that  Jdrainn  is  dependent  upon  the 
region  of  operation  of  the  n  MOSFET  as  the  capacitor  voltage,  VQ,  falls.  Indeed, 
this  is  the  reason  for  the  use  of  a  dependent  current  source  symbol  in  Figure  2.9.b. 
From  Weste  and  Eshraghian  (Weste  and  Eshraghian,  1985,  p.  139): 

“. . .  it  is  evident  that  the  fall  time,  tj,  consists  of  two  intervals: 

1.  f  =  period  during  which  the  capacitor  voltage,  V0,  drops  from 
0.9  •  V  dd  to  (V  dd  —  Vton{T })  [/drainn  =  saturation  current] . 

2.  t ^  =  period  during  which  the  capacitor  voltage,  V0,  drops  from 
{V dd  —  Vton{T})  to  0.1  •  V dd  [/drainQ  =  linear  current].” 

Figure  2.9.b  can  now  be  used  to  write  an  equation  at  the  interconnection 
node  based  on  Kirchoff’s  current  law:  2 


^drainn  (  *c)  ^ 

2  Current  flow  convention:  Current  flowing  out  of  a  node  is  positive,  current 
flowing  into  a  node  is  negative. 
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TABLE  2.2  PSPICE  MOSFET  MODEL  PARAMETERS 
(FROM  MICROSIM  CORPORATION’S  PSPICE) 


L 

channel  length 

meter 

W 

channel  width 

meter 

AD 

drain  diffusion  area 

square  meter 

AS 

source  diffusion  area 

square  meter 

PD 

drain  diffusion  perimeter 

meter 

PS 

source  diffusion  perimeter 

meter 

LEVEL 

model  type  (1,  2,  or  3) 

LD 

lateral  diffusion 

meter 

VTO 

0-bias  threshold  voltage 

volt 

KP 

transconductance 

amp/volt2 

GAMMA 

bulk  threshold  parameter 

volt1/2 

PHI 

surface  potential 

volt 

LAMBDA 

channel-length  modulation 

volt-1 

RD 

drain  ohmic  resistance 

ohm 

RS 

source  ohmic  resistance 

ohm 

RSH 

drain,  source  diffusion  sheet  resistance 

ohm/sq. 

PB 

bulk  junction  potential 

volt 

CJ 

bulk  junction  0-bias  bottom  capacitance/area 

farad/meter2 

CJSW 

bulk  junction  0-bias  perimeter  capacitance/length 

farad/meter 

MJ 

bulk  junction  bottom  grading 

MJSW 

bulk  junction  sidewall  grading 

FC 

bulk  junction  fwd-bias  cap.  coef 

CGSO 

G-S  overlap  cap. /channel  width 

farad/meter 

CGDO 

G-D  overlap  cap. /channel  width 

farad/meter 

NSUB 

substrate  doping  density 

1/cm3 

NSS 

surface  state  density 

1/cm2 

NFS 

fast  surface  state  density 

1/cm2 

TOX 

oxide  thickness 

meter 

TPG 

gate  material:  +1  =  opposite  of  substrate 

—  1  =  same  as  substrate 

0  =  aluminum 

XJ 

metallurgical  junction  depth 

meter 

UO 

surface  mobility 

cm2 /volt-sec 

UCRIT 

mobility  degradation  critical  field 

volt /cm 

UEXP 

mobility  degradation  exponent 

VMAX 

max.  drift  velocity 

meter/ sec 

NEFF 

channel  charge  coef. 

DELTA 

width  effect  on  threshold 

TABLE  2.3  MOSFET  TEMPERATURE  EFFECTS 
(FROM  MICROSIM  CORPOPATION’S  PSPICE) 


EG{T}  =  1.16  -  (0.000702  •  T2)  /  (T  +  1108) 

PB{T}  =  PB  •  T/Tnom  -  3  •  Vt  ■  ln(T /Tnom) 

— EG{T}  4-  Eg{Tnom}  •  T/Tnom 

PSI{T}  =  PHI  •  T/Tnom  -3  -Vt-  ln(T /Tnom) 

—EG{T}  4-  Eg{Tnom}  ■  T/Tnom 

CJ{T }  =  CJ  •  (l  4-  MJ  ■  (.0004  •  (T  -  Tnom)  +  (1  -  PB{T)/PB))) 
c  t  iwitx  —  r  t  cw 

1 )  (  0004.  (T  _  Tnom) + (1  _  PB{r}/Pa))) 

K\P{X}  =  KP  ■  (T/Tnom)-'/'1 

CTO{T}  =  UO  ■  (T/Tnom)-'/1 

OT  =  (*P{r}  ■  100  •  TOX)/(£rO{T}  ■  e„) 


n 

I 

I 

fi**' 


Note: 


All  temperatures  in  degrees  Kelvin 
Tnom  =  nominal  temperature  (assumed  to  be  300.15  K) 
eQ  =  Electric  Field  Constant  (permittivity  of  vacuum)  [^] 
Vt  —  K  -  T / q  (thermal  voltage) 

K  =  Boltzmann’s  constant 
q  =  electron  charge 
T  =  operating  temperature 
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Figure  2.8  Schematic  of  a  Static  CMOS  Inverter  with  Capacitive  Load¬ 
ing 

dV  dV 

=0  ;  ^c  =  CL■-?^  (2.1) 


Applying  equation  (2.1)  to  the  first  interval  where  fdrainn  =  saturation 


current  gives: 


(£MD.y{Vllsn-VtoJT})’  +  CL 


After  substituting  Vdsn  =  V0  and  Vgsn  =  Vin  =  Vdd  and  some  algebraic 
manipulation  this  equation  becomes: 


dt  =  - 


(z5^)  '  (KPJT))  ■  (Vdd-VtoJT)) 


7  • 


Integrating  the  left  side  of  equation  (2.2)  from  t  =  1 1  t o  t  =  t2  and  the  right 
side  over  the  corresponding  output  voltages  VQ  =  0.9 Vdd  to  VQ  =  Vdd  —  Vton{T } 


gives: 


th  ~ 


2CL-  {VtOgiT}  -0-1  •  Vdd) 
(t*z)  ■  (KP»m)  ■  (VM-VtoJT)y 
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<a>  Plot  of  Circuit  Uoltagaa  Against 
Tima  For  Figura  2.8. 


<b>  Equiualent  Circuit. 


Figure  2.9  Effects  of  the  Rising  Edge  of  a  Pulsed  Input  on  a  Static 
CMOS  Inverter 


Equation  (2.1)  may  be  applied  to  the  second  interval  where  IAr,  ■  =  linear 

uni  n  u 


current: 


^)-VdSni2(V9Sn-VtojT})-Vdsn)+CL-^=0  (2.4) 


Again  substituting  Vdsn  =  VQ  and  Vgsn  =  Vin  =  Vdd  and  performing 
some  algebra  gives: 


- -  --  _ LJ _ 


Integrating  the  left  side  of  equation  (2.5)  from  t  =  t2  to  t  =  <3  and  the  right 
side  over  the  corresponding  output  voltages  V0  =  Vdd  —  Vton{T}  to  V0  =  0.1  Vdd 


(  w, 


(d«r) '  (AP»(TD  ■  (vdd-Vt°*{T}) 


(- 


Vdd- 20 -Vton{T} 
Vdd 


The  total  fall  time,  tj,  is  found  by  summing  equations  (2.3)  and  (2.0): 


if  _  th  +  th  _ 


u 

{i^;)-(KPnm)-(vdd-vtojT }) 

(rto„{r}-o.ir<M)  ,  l 

(Vdd  —  Vton{T})  +2  °S'(  Vdd  )  U‘ 


A  similar  procedure  may  be  applied  to  find  the  rise  time,  tr.  Given  the 
circuit  of  Figure  2.8  in  which  C L  is  initially  discharged  so  that  VQ  =  0,  the  input 
voltage  instantaneously  falls  from  V-n  =  Vdd  to  V-n  =  0.  The  effect  of  the  input 
voltage  on  the  circuit’s  behavior  is  plotted  in  Figure  2. 10. a.  In  this  figure  the 
regions  of  operation  of  the  n  and  p  MOSFETs  are  labeled  in  accordance  with  the 
definitions  of  the  regions  presented  in  Table  2.1.  From  the  figure  it  is  evident  that 
the  n  MOSFET  immediately  enters  the  cutoff  region  where  /drainn  =  0.  The  p 
MOSFET  passes  through  two  regions,  as  VQ  rises:  the  saturation  region  then  the 
linear  region.  It  is  possible  then  to  redraw  Figure  2.8  as  Figure  2.10.b  during  rise 
time  with  the  understanding  that  /drainp  is  dependent  upon  the  region  of  operation 
of  the  p  MOSFET  as  the  capacitor  voltage,  V0,  rises.  Figure  2.10.b  can  be  used  to 
write  an  equation  at  the  interconnection  node  based  on  Kirchoff's  Current  law: 
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<a>  Plot  of  Circuit  Uoitagaa  Against 
Tima  for  Figure  2.8. 


<b>  Equiuaiant  Circuit. 


Figure  2.10  Effects  of  the  Falling  Edge  of  a  Pulsed  Input  on  a  Static 
CMOS  Inverter 

Since  the  p  MOSFET  passes  through  two  regions  of  operation  during  the 
rise  time,  equation  (2.8)  must  be  applied  separately  to  each  region  and  the  separate 
results  are  summed  together.  The  lower  voltage  limit  in  the  time  span  denoted  t 
is  0.1  V dd.  The  upper  limit  may  be  found  from  Figure  2. 10. a.  In  Figure  2. 10. a.  the 
p  MOSFET  crosses  over  from  the  saturation  region  to  the  linear  region  when: 


'%  lV.v.v.v  .va  ;v.  v.  v  .v.%  .vLv 


(1  >  fa*  ,  '  '  *»§ '»**■  A**'  .li  Vb1 ,!»’  Al'.h'.ti'Ji’Jl'.U’ 


\Vdam\-\Vgs,\-\VtoJT}\ 


\Vdsp\  =  Vdd-VQ 


\VgsJ  -  \Vtop{T}\  =  Vdd  - \Vtop{T}\ 


(2.10) 


(2.11) 


Substituting  equations  (2.10)  and  (2.11)  into  (2.9)  and  solving  for  VQ  gives: 

V0  =  \Vto  {T}\ 


(2.12) 


Thus  the  rise  time  intervals  <  and  t  are  defined: 

rj  r2 


1.  =  period  during  which  the  capacitor  voltage,  VQ ,  rises  from  0.1  Vdd  to 
\Vtop{T}\  [/drainp  =  saturation  current] . 

2.  tr2  =  period  during  which  the  capacitor  voltage,  V0,  rises  from  \Vtop{T}\ 
to  0.9 Vdd  [/drajr|p  =  linear  current]. 

The  drain  current  equations  are  substituted  into  equation  (2.8)  for  the  two 
regions  of  operation  of  the  p  MOSFET  in  identical  fashion  to  the  procedure  used 
in  determining  t y  for  the  n  MOSFET.  The  result  is: 


t  —  t  +  t  — 

r  rl  r2 


(\Vto  {T}\  -  0.1  ■  Vdd)  t  1 


(Vdd-\VtolT)\) 


+  o  ■ IoSe 


19  ■  Vdd  —  20  •  | Vfo  { T}  | 


(2.13) 


Figure  2.11  shows  an  idealized  MOSFET  where  L  and  IV*  are  the  length 
and  width  of  the  channel,  respectively.  In  the  fabrication  process  that  produces 
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this  ideal  MOSFET  doping  ions  diffuse  only  vertically  down  into  the  substrate 
through  designer  specified  windows  in  the  mask  that  define  the  source  and  drain 
regions.  Figure  2.12  shows  the  result  of  a  more  true  to  life  fabrication  process.  In 
this  process  the  doping  ions  diffuse  not  only  vertically  down  into  the  substrate  but 
laterally  under  the  edges  of  the  window  as  well.  The  effect  is  that  the  drain  and 
source  regions  are  bloated  in  all  directions  by  an  amount  LD.  Since  this  lateral 
diffusion  also  occurs  at  both  ends  of  the  channel,  the  channel  length  is  effectively 
reduced  by  2  •  LD.  The  effective  channel  length  is  denoted  Lejj  in  Table  2.1  and 
in  equations  (2.7)  and  (2.13).  For  short  channel  devices  this  decrease  in  channel 
length  can  have  a  significant  impact  on  drain  current. 

Rewriting  equations  (2.7)  and  (2.13)  in  terms  of  effective  channel  length 

gives: 

,  _ l_ £l _ 

'  (rr-&B.-) '  •  (va-vtojT)) 

’(Vton{T}~  0.1  Vdd)  1  / 19 -Vdd- 20  Vton{T} 

( Vdd~Vton{T })  +2‘°SeV  Vdd 


t  _  _ *  ’  _ 

{T^LD^(KPr{T))-(Vdd-\vtor{T)\) 

'(|y«°,|T)|-0.1-y<«)  ,  1  ,  /19 •  V<M  —  20jjK/op{r}|\  1 

(Vdd- \Vtop{T}\)  +  2  '  °S'  \  Vdd  )  ° 

where  Vt.on{T }  and  |V’top{T}|  are  defined  in  Table  2.1  and  A'Pn{T}  and  A'Pp{T} 
are  defined  in  Table  2.3. 


Delay  time,  denoted  td ,  is  the  time  it  takes  a  logic  transition  to  pass  from 
the  input  of  a  gate  to  its  output.  In  fully  restored  CMOS  logic  td  is  explicitly 
defined  to  be  the  amount  of  time  that  elapses  between  the  moment  that  the  input 
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signal  to  a  gate  transitions  through  Vdd/2  and  the  moment  that  the  output  affected  J 

by  that  input  transitions  through  Vdd/2  [Weste  and  Eshraghian,  1985,  p.  141]:  i 

i 


t 


da.vg 


K  +  tf 

4 


' 

(2.16)  \ 


One  of  the  more  attractive  feature  of  CMOS  circuits  is  that  symmetric  j 

output  sourcing  and  sinking  currents  are  easily  achieved  resulting  in  equal  rise  and  < 

« 

fall  times.  When  this  is  achieved  t,  becomes: 

«avg 


Javg 


(2.17)  ; 
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To  achieve  tr  =  t ^  for  the  inverter  in  Figure  2.8  equations  (2.14)  and  (2.15) 
are  set  equal  to  one  another  then  Wn  is  solved  for.  The  result  is: 


Wn  ~W-  A 

n  p 


(2.18) 


where 


(£„  -2  •  LDJ  •  (KPr{T))  ■  {Vdd-  \Vto,{T)\) 
(i,  -  2-  LDr)  ■  (KPJT))  ■  (Vdd- Vton{T}) 


(V to n  [T]  —  0.1  V dd)  I  ,  /  \9  V dd  —  20  V toa  {T}  \ 

(vdd-Vton[T))  +2  l0Sa  Vdd  ’ 


(lVtOp{T}|-0.lV^)  ,  /19- 

(u<id-|vt0p{r}|)  2  °eV 


Vdd-20  \Vtov{T)\  \ 
Vdd  ) 


(2.19) 


In  the  drain  current  equations  of  Table  2.1  it  is  evident  that  as  the  channel 
length,  T,  is  decreased  drain  current  increases.  To  maximize  speed,  L  should  be  the 
minimum  channel  length  permitted  by  the  design  rules  of  the  particular  technology 
being  used.  This  length  is  usually  the  same  as  the  minimum  feature  size  of  the 
technology. 
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Keeping  in  mind  the  previous  paragraph  on  the  selection  of  Ln  and  Lp, 
all  the  values  on  the  right  side  of  equation  (2.18)  are  now  known  except  for  Wp. 
Once  Wp  is  found,  a  Wn  that  results  in  tj  ~  tr  and  tdavg  —  can  be  readily 

obtained  from  equation  (2.18).  Finding  Wp  is  dealt  with  in  Section  4. 

3.  Capacitive  Loading 

The  load  capacitance,  CL ,  has  been  referred  to  in  Section  B.2,  but  not 
yet  described.  The  term  CL  represents  three  parallel  capacitances  [Weste  and 
Eshraghian,  1985,  p.  123]: 


CL  -  Cbdlotii  +  C  ^gate  4-  CT 


(2.20) 


io  ad 
de  vices 


where 


CHota,  —  the  diffusion  capacitance  of  the  drain 
regions  connected  to  the  output 
C  ^gate  =  gate  capacitance  of  the  load  devices  connected 


load 

device* 


to  the  output  of  the  gate  under  design 
Cr  =  routing  capacitance  of  the  interconnect 

between  the  p  block,  the  n  block,  and  the  load 
It  should  be  noted  that  in  the  definition  of  the  term  Cid(  ^  (  the  word 
region  is  plural.  That  is  to  say  that  in  Figure  2.8  each  transistor,  when  active, 
must  drive  not  only  its  own  drain  diffusion  to  substrate  capacitance  but  also  that 
of  the  other  transistor  that  is  connected  to  it.  This  is  so  because  the  drains  are 
connected  together  resulting  in  parallel  capacitances.  CWtota),  C  ^^gate.  and  Cr 

load 

device* 

are  described  in  the  next  three  subsections. 
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a.  Diffusion  to  Substrate  Capacitance  of  the  Interconnection 
Node 

Figure  2.13  shows  a  drain  area  diffused  into  the  substrate  and  bloated 
by  an  amount  LD  in  ail  horizontal  directions.  To  the  top  of  the  figure  would  be 
the  channel,  but  the  channel,  gate,  and  source  have  been  removed  to  make  clear 
that  only  the  drain  area  is  being  considered  here.  At  the  bottom  end  of  the  drain 
is  an  area  of  diffusion  that  protrudes.  This  area  has  been  reserved  for  a  metal  to 
diffusion  surface  contact.  Similar  contacts  may  be  placed  alongside  this  contact  at 
the  bottom  end  of  the  drain  or  on  the  sides  of  the  drain  or  both  (but  obviously  not 
at  the  channel  end).  The  important  drain  area  dimensions  shown  in  Figure  2.13 


Wd  eff=  Effective  diffusion  width  =  W  +  2  •  LD 

L d  e/f  =  Effective  diffusion  length  =  Ld  rnaak  +  2  •  LD 

where  Ld  maslt  is  the  mask  level  diffusion  length 
Wc  ejf  —  effective  contact  width  =  Wc  +  2  •  LD 

where  Wc  is  the  mask  level  contact  width 
Lc  =  mask  level  contact  length 

In  Figure  2.13  the  length  of  the  contact  area  at  the  bottom  edge  of  the 
diffusion  runs  from  top  to  bottom,  and  the  contact  width  runs  from  left  to  right. 
If  contacts  were  placed  along  the  sides  of  the  diffusion  area  shown  in  Figure  2.13 
their  lengths  would  run  from  left  to  right  in  the  figure  while  their  widths  would 
run  from  top  to  bottom.  These  dimensions  are  just  the  reverse  of  the  dimensions 
of  the  contact  along  the  bottom  edge  of  the  diffusion  area  in  Figure  2.13. 

From  Figure  2.13  the  drain  area  is  defined: 


AD  =  W4ieJ/  ■  Ldteff  +  N  ■  \VC'tff  •  Lc 


s 


d.  mask 


CROSS  SECTION,  PERSPECTIVE  VIEW 

Figure  2.14  Bulk-Drain  Depletion  Capacitance  and  Metal  to  Diffusion 
Contact  Capacitance 

Figure  2.14  shows  the  three  capacitances  of  concern  in  calculating  Cbd. 
CJ{T }  and  C J SW{T)  in  Figure  2.14  are  defined  in  Tables  2.2  and  2.3.  Cmd  is 
the  capacitance  that  results  by  bringing  metal  into  contact  with  the  protrusion 
areas — like  the  one  at  the  bottom  edge  of  the  diffusion  in  Figure  2.14 — that  are 
reserved  for  metal  to  diffusion  contacts. 
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Having  defined  the  dimensions  of  the  drain  diffusion  and  its  capacitive 
components,  Cbd  may  now  be  defined  [MicroSim  Corp.,  1987,  p.  79]: 

Cbd{T,  Vbd}  =  bulk-drain  depletion  capacitance 

+  metal  to  diffusion  contact  capacitance 
=  bottom  junction  capacitance  -f  sidewall  capacitance 
+  metal  to  diffusion  contact  capacitance 
=  AD  CJ{T}  ■  Cbdj  {T,  Vbd} 

+PD  ■  CJSW{T }  •  Cbds{T ,  Vbd) 

+N  ■  Lc  ■Wc  ■  Cmd 

(2.23) 


Cbd  is  a  function  of  Vbd  because  Cbdj  and  Cbds  in  equation  (2.23) 
axe  functions  of  Vbd.  This  implies  that  all  three  will  vary  during  switching  due  to 
variations  in  bias  level.  To  reduce  the  difficulty  that  this  introduces  in  the  analysis, 
values  that  are  dependent  on  temperature  only  may  be  assigned  to  Cbdj  and  Cbds 
if  the  values  assigned  are  chosen  carefully  and  with  good  judgment.  [McCarthy, 
1982,  p.58] 

MicroSim’s  PSpice  defines  Cbdj  {T,  Vbd}  and  Cbds{T,  Vbd}  [MicroSim 
Corp.,  1987,  p.  80]: 

Cbdj{T,  Vbd}  =  (1  -  FC)-{1  +  MJ) 

■  ^1  —  FC  ■  (1  -f  AIJ)  4-  M  J  ■  |  Vhd\J  PB{T}^  [dimensionless]  (2.24) 

Cbds{T,Vbd}  =  (1  -  FCy{'  +  x,JSW) 

■  (l  -  FC  ■  (1  +  M.JSW)  +  MJSW  ■  \Vbd\/PB{T })  [dimensionless] 


During  switching  0.1  •  Vdd  <  \Vbd\  <  0.9  -Vdd.  |  Vfe<i|  does  not  dwell  at 
either  end  of  this  range  any  longer  than  it  does  at  the  other.  Therefore,  Cbdj  and 
Cbds  may  be  expressed  as  function  of  T,  independent  of  Vbd  by  simple  averaging: 


CbdJd^{T} 


1 

2 


Cbdj  {T,  Vbd} 


+Cbdj{T,Vbd} 


Vbd=0.9  Vdd 


Vbd=0.1Vdd  J 

(2.26) 


Cbds 


da.vg 


Cbds{T ,  Vbd} 

+Cbdj{T,Vbd} 

. 

Vbd=0.9Vdd 

Vbd=0.l  Vdd  J 

(2.27) 


Substituting  equations  (2.26)  and  (2.27)  into  equation  (2.23)  allows 
Cbd  to  be  expressed  as  a  function  of  temperature  only: 

Cbd{T)  =AD  ■  CJ{T}  ■  Cbdjhvg{T}  +  PD  ■  CJSW{T }  -  Cbdsw%{T) 

+  N  ■  Lc-Wc-  Cmd  (2.2S) 


Substituting  equations  (2.21)  and  (2.22)  for  the  drain  area  and  drain 
perimeter  into  equation  (2.28)  and  grouping  terms  gives: 

Cbd{T)  =Wd,„  ■  (Ld,„  ■  CJ{T)  ■  Cbdj„f{T }  +  2  •  CJSW{T )  ■  Cbds„g{T)) 
+  2  •  +  n-lc )  •  cvsirir) .  Cbds„t{T} 

+  N-Lc.  ■  CJ{T)  ■  CMr,vg{Tl  +  Wc  ■  Cmd)  (2.20) 

Figure  2.15  is  a  literal  translation  from  the  schematic  in  Figure  2.S 
(with  no  load  attached)  to  a  mask  level  layout.  That  is  to  say  that  the  p  and  n 
MOSFETs  are  in  the  same  position  with  respect  to  one  another  in  Figure  2.15  as 
they  are  in  Figure  2.8.  Figure  2.15  shows  graphically  why  each  transistor  drives 


not  only  the  load  and  its  own  drain  diffusion  capacitance  but  the  drain  diffusion 
capacitance  of  the  other  transistor  as  well.  It  is  because  the  two  drain  areas  are 
hard-wired  together. 

Since  the  two  capacitances  Cbdp{T}  and  Cbdn{T}  are  in  parallel  with 
one  another  CWtotal{T}  becomes: 


CHotuCT}  =  CbdAT}  +  Cbdn{T} 


(2.30) 


Equation  (2.30)  can  now  be  fully  expanded  according  to  the  following  procedure: 


1.)  Expand  Wd  eff,  Ld  eff,  and  Wc  tfJ  in  equation  (2.29)  according  to  their 
definitions  at  the  beginning  of  Section  B.3.a. 


2.)  Take  the  expression  resulting  from  step  1  and  substitute  it  into  equa¬ 
tion  (2.30). 


3.)  Make  use  of  equation  (2.18)  to  express  n  channel  widths  in  terms  of  p 
channel  widths. 


+  2CJSWI{T}CM.„,J{T}) 

+  2  •  ([ X,„„k,  +  2  •  ID,]  +  lf,-L„)-  CSJW,{T }  •  Cbds {T} 

+  W,  '  L„  •  (K  +  2  •  ID,]  ■  CJ,{T }  ■  Cbdj,vlJT)  +  W„  ■  CmJz) 
for  x  =  p,  n 

(2.33) 

and  .4  in  equation  (2.32)  is  defined  by  equation  (2.19). 
b.  Gate  Capacitance  of  the  Next  Stage 

The  capacitance  of  the  input  gates  of  the  immediate  next  logic  stage 
is  determined  in  this  section.  Figure  2.16  shows  a  single  load  MOSFET  from  a 
top  view.  The  load  channel  dimensions  W)oad  and  Lch  load  eff  are  defined  in  Figure 
2.16  where: 

^ch, load, eff  ^ch , load  .mask  ^  ’  I'D 

Figure  2.17  shows  three  capacitances  formed  between  the  gate  material  above  the 
surface  plane  of  the  wafer  and  the  channel  and  diffusion  areas  below.  The  capaci¬ 
tance  Cg  is  the  capacitance  that  forms  between  the  gate  and  the  substrate.  This 
capacitance  is  much  larger  when  the  load  transistor  is  off  than  when  it  is  on.  In 
design  work  it  is  always  safest  to  estimate  capacitive  loading  conservatively  so  that 
a  realistic  safety  margin  is  allowed  for  variable  operating  conditions  of  a  physical 
device.  Since  the  load  capacitance  component  is  largest  when  the  load  transistor 
is  off  it  is  best  from  an  engineering  standpoint  to  assume  that  the  physical  device 
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Figure  2.16  Load  MOSFET  Channel  Dimensions 

will  be  operated  at  low  frequency  (below  100  Hz).3  For  low  frequency  operation 
Cg  may  be  estimated  from  [Weste  and  Eshraghian,  1985,  p.  125]: 


Cg  = 


C0  '  erel .oxide  {^} 

TOX 


)  •([£,*.... d.  —i,-  2  -LD\-W^)  (2.34) 


Cgs  and  Cgd  in  Figure  2.17  are  the  capacitances  between  the  gate  and 
the  source  diffusion  and  between  the  gate  and  drain  diffusion.  These  capacitances 
are  a  direct  result  of  the  overlapping  of  the  gate  and  drain  and  of  the  gate  and 
source  caused  by  the  lateral  diffusion  of  doping  ions  into  the  intended  channel  area 


3  Note  that  channel  length  becomes  shorter  when  the  load  device  oper¬ 
ates  in  the  saturation  region.  In  saturation,  Lch  load  e//  =  (Lch  load  mask  -2  ■  LD)  ■ 

(l  —  LAMBDA  •  |VjJ).  Since  the  load  device  operates  in  all  three  regions — cutoff, 
linear,  and  saturation — the  region  chosen  for  modeling  the  physical  load  channel 
length  is  that  which  yields  the  most  conservative  estimate  for  Cg ,  namely  the  linear 
region  where  LchXoiAtff  =  Lc/l,load,raask  -2- LD. 
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as  discussed  in  Section  B.2.  Cgrs  and  Cgd  are  determined  from  [MicroSim  Corp., 
1987,  p.  80]: 

Cgs  =  CGSO  •  ^C4d 
C<7d  =  CGDO  •  ^oad 

k — ^hi  load,  mask — ^ 


RIGHT  SIDE  UIEW 


Figure  2.17  Load  MOSFET  Gate  Capacitance 


As  Cg,  Cgs ,  and  Cgd  aure  in  parallel  with  one  another,  C^gate  is 


equal  to  their  sum  over  all  load  devices: 


C  ^  gate  —  [Cg  4-  Cgs  4-  Cgd ] 


load 

devices 


(2.34) 


lo*d  load 

devices  devices 


-  E 


load 

devices 


f0  £rel, oxide  {^} 

TOX 


) •(* 


ch, load  .mask 


-2 -LD)-Wl0 


+  CGSO  •  Wload  +  CGDO  •  VVjoad 


(2.35) 


where  all  the  variables  in  equation  (2.35)  are  defined  in  Tables  2.2  and  2.3. 
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c.  Routing  Capacitance 

Routing  capacitance  between  layers  or  between  a  layer  and  substrate 
is  determined  by  multiplying  the  capacitance  per  unit  area  measured  between  the 
specific  layers  by  the  area  that  is  jointly  occupied  by  both  layers.  Values  for  these 
unit  area  capacitances  must  be  obtained  from  the  fabrication  facility  being  used. 
Typical  values  for  a  4  micron,  two  level  metal,  silicon  gate  CMOS  process  are 
provided  in  Weste  and  Eshraghian  [Weste  and  Eshraghian,  1985,  p.  135].  An 
alternate  method  is  to  remove  Cr  from  equation  (2.20)  and  compute  the  delay  due 
to  routing  capacitance  separately.  A  paper  by  John  L.  Wyatt,  Jr.  entitled,  “The 
Practical  Engineer’s  No-Nonsense  Guide  to  On-Chip  Signal  Delay  Calculation,” 
available  through  the  M.I.T.  VLSI  memo  series  gives  worst  case  bounds  on  delay 
time  due  to  routing  for  use  in  this  type  of  approach  [Wyatt,  1987].  Note  that  the 
capacitance  due  to  diffusion  to  metal  contacts  on  the  drains  wired  to  the  node  that 
connects  the  p  and  n  blocks  is  considered  separate  from  Cr,  and  is  developed  in 
Section  B.3.a. 

4.  Finding  Wp  and  Wn 

Figure  2.15,  which  was  discussed  in  Section  B.3.a,  shows  the  dimensions 
Wp  and  Wn.  As  discussed  in  Section  B.l,  the  circuit  designer  needs  to  have  a 
method  to  determine  the  widths  Wp  and  Wn  required  to  drive  the  parasitics  and 
known  capacitive  load  within  a  designer  specified  time  constraint  given  some  ex¬ 
pected  temperature  and  supply  voltage  operating  conditions.  All  that  has  been 
accomplished  thus  far  in  Chapter  II  is  to  gather  together  the  tools  necessary  to 
make  this  calculation.  In  this  section  the  tools  axe  used  together  to  find  W  and 
Wn  for  the  static  CMOS  inverter  of  Figures  2.8  and  2.15. 
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m  >1.' 


where: 


Rearranging  equation  (2.15)  to  solve  for  Wp  gives: 

V/=F  CL  [G  +  3] 


2-(Lp-2LDp) 

(tr)-(KPp{T))-(Vdd-\Vtop{T}\) 
(\Vtop{T}\- 0.1 -Vdd) 

(Vdd  —  \Vtop{T}\) 
tt  i  ,  ( 10  •  Vdd- 20 -\Vto  {T}\\ 

H  ~  2  °®e  \ - Vdd  - - j 


(2.36) 


(2.37) 


(2.3S) 


(2.39) 


But  C ,  is  itself  a  function  of  the  unknown  W„  since 

L  p 


CL  =  Cbdlotil  +  C  ^ gate  +  Cr  ;from  equation  (2.20)  (2.40) 


lo  %d 
devices 


CL  =  C  •  Wp  +  D  +  C  gate  +  Cr  ;  from  equation  (2.31)  (2.41) 


load 
de  vices 


Renaming  C  ^  gate  and  Cr  for  simplicity: 


losd 

devices 


B  =  C  Jugate 


(2.42) 


losd 

devices 


E  =  C, 


(2.43) 


Substituting  equations  (2.42)  and  (2.43)  into  equation  (2.41)  gives: 


CL  =  C  ■  Wp  +  D  +  B  +  E 


(2.44) 


Substituting  equation  (2.44)  into  equation  (2.36)  and  solving  for  IF  gives 


the  final  result  for  the  static  CMOS  inverter: 

~  _  F^(G  +  H)-(B  +  D+1) 
p  1  -F-C-(GVH) 


(2.45) 


Wn  =  W  •  A  (from  equation  (2.18))  (2.46) 
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The  equations  defining  the  variables  in  equations  (2.45)  and  (2.46)  are 
referenced  in  Table  2.4. 

Practical  calculation  of  Wp  requires  a  computer  program.  A  program  to 
calculate  Wp,  Wn,  and  all  other  MOSFET  parameters  required  on  a  MOSFET 
card  in  a  PSpice  deck  is  provided  in  Appendix  A.  This  program  is  written  for  an 
HP41CX  with  card  reader  and  two  extended  memory  modules. 

TABLE  2.4  IMPORTANT  EQUATIONS 
FOR  THE  STATIC  CMOS  INVERTER 


/, 

I 


Variable 

Defining  Equations(s) 

Wp 

(2.45) 

wn 

(2.46) 

A 

(2.19) 

B 

(2.42) 

and 

(2.35) 

C 

(2.32 

D 

2.33 

E 

2.43 

F 

2.37 

G 

2.38 

H 

2.39) 

5.  Drain  Resistance 


With  respect  to  the  drain  diffusion  region,  only  its  capacitive  value  has 
been  determined  thus  far.  Proper  modeling  of  a  MOSFET  in  PSpice  requires 
that  the  drain  resistance  be  provided  as  well.  Figure  2.18  shows  a  simple  drain 
diffusion  region  (substrate  omitted  from  the  figure)  and  its  resistive  equivalent.  RD. 
The  resistance  between  end  faces  of  the  figure  can  be  calculated  from  [McCarthy. 
1982,  p.  39]: 

__  _  drain  length  . 

=  P  ‘  end  face  area 


where  p  is  the  average  resistivity  for  the  diffusion  layer. 
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Figure  2.18  Drain  Diffusion  Region  and  Resistive  Equivalent 


Figure  2.19  shows  a  drain  diffusion  area  with  a  single  contact  area  defined 
and  a  resistive  equivalent.  Since  R  and  R  in  Figure  2.19  are  in  series: 


RD  =  R  +  R. 


(2.50) 


RD  =  p 


Wd,e/fXJj 


+  P‘ 


"cffXJ, 


RD  =  RSH  ■ 


(Ld, misk  +  2  '  ^D) 


(W  +  2-LD) 


+ 


Wc  +  2-LD 


(2.51) 


off  K- 


Figure  2.19  Drain  Diffusion  Region  with  a  Single  Metal  to  Diffusion 
Contact  Area  and  Resistive  Equivalent 


Figure  2.20  is  an  extension  of  the  resistance  calculations  performed  in 
equations  (2.49)  and  (2.51)  to  the  general  case  where  several  contact  areas  may  be 
placed  around  the  diffusion  area.  In  the  figure 


^"eq  -L-  4-  j-  -I _ L_ 

Ri  '  Ri  '  Rn 


But  assuming  that  all  the  contact  areas  have  identical  dimensions,  R1  =  J?.,  = 


=  R  =  R  ,  where  Rc  is  the  resistance  of  any  one  contact  area  and  so: 


(2.52) 


Using  the  results  of  equations  (2.52)  and  (2.51)  gives  the  drain  resistance  for  the 


general  case  depicted  in  Figure  2.20: 


RD  =  R  +  Req  =  RS H  ■ 


'd, mask  +2'  LD\  1  / 

W+2LD  )  N  '  V 


N  \Wc  +  2-LD< 


)]  (2.53) 


Given  that  the  contact  areas  employed  by  the  designer  are  of  the  minimum 
dimensions  permitted  by  the  design  rules  for  the  technology  being  used,  equations 
(2.23)  in  Section  B.3.a.  and  (2.53)  in  this  section  confirm  an  intuitive  trade  off: 

•  placing  more  contact  areas  decreases  drain  resistance  but  increases  drain 
capacitance. 

•  placing  fewer  contact  areas  decreases  drain  capacitance  but  increases  drain 
resistance. 

6.  Theoretical  Bounds  on  Inverter  Performance 

Equation  (2.45)  can  be  manipulated  algebraically  into  the  form: 


(V-0  -(G  +  H)-(D  +  D  +  E) 
l  v  =  -1 - L - 

P  t  _  2  (  Ly-2  LDy)  C-(G+H) 

r  (l\  Pp  {T})  (v  dd-\V  top  (T}|) 


(2.54) 
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Figure  2.20  Drain  Diffusion  Region  with  N  Metal  to  Diffusion  Contact 
Areas  and  Resistive  Equivalent 


The  denominator  of  equation  (2.54)  shows  that  the  p  transistor  width,  W  , 
goes  to  infinity  as  the  denominator  goes  to  zero  which  occurs  at  the  point  where: 


_  2-(Lp-2  LDp)-C-(G  +  H ) 
r  (KP  {T})  •  (Vdd  —  \Vto AT}\) 


(2.55) 


Obviously  the  p  MOSFET  required  to  source  enough  current  to  achieve  the 
rise  time  defined  by  equation  (2.55)  would  be  so  large  as  to  be  physically  impossible 
to  construct.  Therefore,  equation  (2.55)  imposes  a  theoretical  bound  on  the  rise 
time  achievable  by  a  static  CMOS  inverter  of  the  type  shown  in  Figures  2.8  and 


If  a  previously  constructed  inverter  is  being  examined  its  minimum  theo¬ 
retical  rise  time  can  be  determined  by  rearranging  equation  (2.45)  into  the  following 


form. 


t  2-(Lp-2-LDp)-(G  +  H)  I-  (B  +  D  +  E) 
r  (KPp{T})-(Vdd-\Vtop{T}\)'[  Wp  + 


(2.56) 


It  is  important  to  realize  that  the  inverter  being  examined  may  have  been 
constructed  such  that  the  variable  A  in  the  expression  Wn  —  Wp  ■  A  is  not  defined 
by  equation  (2.19).  In  such  a  case,  the  numerical  value  of  A  that  is  obtained  from 
the  physical  layout  of  the  inverter  should  be  used  in  the  calculation  of  the  variable 
C  in  equation  (2.56). 

For  this  same  inverter  that  has  already  been  constructed  the  amount  of 
capacitance  that  it  can  drive  in  tr  seconds  can  be  determined  theoretically  by 
manipulating  equation  (2.45)  into  the  following  form: 


B  +  E  =  Wn 


(tr)-(KPp{T})-(Vdd-\Vtop{T}\) 
2  •  {L  —  2  •  LD  )  ■  (G  +  H) 


-C  -D 


(2.57) 


The  same  caveat  about  the  variable  A  that  was  applied  to  equation  (2.56) 
is  equally  applicable  to  equation  (2.57). 
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C.  EXTENSION  OF  THE  STATIC  CMOS  INVERTER  TRANSIS¬ 
TOR  SIZING  EQUATIONS  TO  GENERAL  STATIC  CMOS 
LOGIC  DESIGN 

The  static  CMOS  inverter  in  Figures  2.8  and  2.15  involved  only  a  single  p 
MOSFET  in  the  p  block  and  a  single  n  MOSFET  in  the  n  block.  Figure  2.21  shows  a 
compound  gate  that  implements  the  logic  function  F  =  ((A  •  B)  +  (C  ■  D ))  [Weste 
and  Eshraghian,  1985,  p.  15].  There  axe  two  parallel  combinations  of  p  MOSFETs 
in  the  p  block  of  this  compound  gate.  The  two  parallel  combinations  are  in  series 
with  one  another.  A  similar  situation  exists  in  the  n  block.  To  apply  the  equations 
developed  in  Section  B  to  a  problem  like  translating  from  the  schematic  in  Figure 
2.21  to  a  mask  level  layout  requires  that  some  of  the  variables  in  equation  (2.45) 
be  modified. 

Equation  (2.30)  is  the  basis  for  computing  C6dtotal{T}  in  the  static  CMOS 
inverter  of  Section  B.  Section  B.3  states  that  CWtotai  {T}  involves  all  of  the  drain 
regions  hard  wired  to  the  interconnection  node  from  which  the  output  is  taken. 
For  the  static  CMOS  inverter  of  Section  B  this  involves  only  one  drain  in  the  p 
block  and  one  drain  in  the  n  block.  For  a  compound  gate  CWtota]{T}  can  involve 
considerably  more  drain  diffusion  area  than  that.  For  example,  in  Figure  2.21  there 
are  four  drain  regions  wired  to  the  interconnection  node.  If  signals  A  and  C  are 
low  and  signals  B  and  D  are  high  in  Figure  2.21,  the  p  MOSFET  fed  by  signal  A 
will  have  to  drive  the  load  capacitance,  the  capacitance  of  its  own  drain,  and  the 
capacitance  of  the  other  three  drains  wired  to  the  interconnection  node  as  well. 
Therefore,  equation  (2.30)  is  modified  for  general  static  CMOS  logic  design: 


Ct’dtotai  =  #p-Cbdp{T}  +  #n-Cbdn{T} 


(2.5S) 
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Figure  ?.21  Static  CMOS  Compound  Gate  Implementation  of 

F=((A-B)+(C-D)). 
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where 

#p  =  the  toted  number  of  p  drains  attached  to  the  interconnection  node 
between  the  p  and  n  blocks,  each  with  identical  dimensions. 

#n  =  the  total  number  of  n  drains  attached  to  the  interconnection  node 
between  the  n  and  p  blocks,  each  with  identical  dimensions. 

Equation  (2.58)  can  now  be  fully  expanded  according  to  the  same  procedure 

used  to  expand  equation  (2.30)  in  Section  B.3.a.  The  result  is  that: 

Cbdtotal  {T}  =  WpC'  +  D'  [farad]  (2.59) 


where  C'  and  D'  are  no  longer  defined  by  equations  (2.32)  and  (2.33): 

c  =  (#,  •  +  2  •  LO„]  ■  CJ,{T )  ■  CbdhvilJ,{T} 

+  #r-2-CJSWp{T)-CMs„sJT) 

+  [£*„>.  +  2 ■  LD„]  ■  CJ„{T )  ■  Cbdj^JT) 

+  #„  •  A'  -2  ■  CJSWJT)  ■  Chds^.{T})  (2.60) 


where  A'  is  defined  later  in  equation  (2.64). 


2-LD . 


([Lj,  mask.  +2 


LD,]-CJ'{T}-CbdjlT) 


+  2- CJSWJT] -Ctds„tJT]) 

+  2  •  ([i,,.*,  +  2  •  LD,\  +  N,  ■  Lct)  ■  CSJWJT )  •  Cbds^  JT) 
+  K  •  (K  +  2  •  LD,]  ■  CJJT }  •  Cbdj^  JT)  +  IVCI  ■  CmJt ) 


for  x  =  p,  n 

(2.61 ) 

For  each  p  transistor  added  in  series  between  the  supply  voltage  and  the  load 
the  rise  time  at  the  load  increases.  If  a  single  p  MOSFET  has  the  ability  to  drive 
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a  load  high  in  tr  seconds  then  placing  k  identical  p  MOSFETs  in  series  would 
increase  the  rise  time  to  k  ■  tr  seconds.  Similarly,  placing  m  identical  n  MOSFETs 
in  series  between  a  load  and  ground  increases  fall  time  from  t  j  seconds  to  m  ■  t  j 
seconds  [Weste  and  Eshraghian,  1985,  p.  181].  The  reason  for  this  is  quite  apparent 
from  Table  2.1.  Placing  transistors  in  series  is  analogous  to  increasing  the  effective 
channel  length  of  a  single  transistor.  In  Table  2.1  effective  channel  length  is  found 
in  the  denominator  of  the  drain  current  equations.  Therefore  increasing  effective 
channel  length  decreases  drain  current  which  implies  longer  rise  and  fall  times. 

In  Section  B.4  the  variable  F  was  calculated  based  on  equation  (2.15).  Equa¬ 
tion  (2.15)  was  calculated  for  a  static  CMOS  inverter  where  only  a  single  p  MOS- 
FET  stood  between  the  supply  voltage  and  the  load.  As  Figure  2.21  illustrates, 
there  may  in  fact  be  a  long  line  of  p  MOSFETs  in  series  between  the  supply  voltage 
and  the  load.  Therefore  the  effective  channel  length  in  equation  (2.15)  should  be 


modified  for  general  static  CMOS  logic  design: 


*.//,=  £  V,-2-lDr) 


(2.62) 


longest 
path  to  V  dd 


where  the  summation  is  over  the  number  of  p  MOSFETs  that  comprise  the  longest 
path  between  the  supply  voltage  and  the  load  and  Lp  is  assumed  to  be  the  minimum 
mask  level  channel  length  permitted  by  the  technology  in  use. 

The  change  in  equation  (2.15)  is  reflected  in  the  calculation  of  the  variable  F' 
for  general  static  CMOS  logic  design: 


2-  £  (Lp-2  LD„) 


longest 
pfcthto  V dd 


(tr)-(KPtT))-(Vdd-\VtotT)\) 


(2.63) 


The  change  in  effective  channel  length  also  makes  it  necessary  to  modify  the 
variable  A  found  in  equation  (2.19)  for  use  in  general  static  CMOS  logic  design. 
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Applying  the  same  reasoning  that  was  used  to  develop  equation  (2.63)  to  the  cal¬ 
culation  of  the  variable  A  gives: 

52  (L.-1-LdA  .(KP,{T}).(V<W-|V«0,{T}|) 

longest 

_  path  to  GND 

f  52  (lr-2  LDr)]  ■  (KPJT})  ■  (Vdd-VtoJT)) 

■  longest  ■ 


L  longest 
p»th  to  V dd 


<Vton{T}-0.1Vdd)  j 


( Vdd-Vton{T })  2 

( 1  V~<oy  {T}  |— O.l-Wd)  1 

(vd^-|Vtop{T’}|)  2 


Vdd-20VtonfT 1 
Vdd 


Vdd-20  \Vtoa{T) 
Vdd 


(2.64) 


Incorporating  these  modifications  into  equations  (2.45)  and  (2.46)  gives  the 


final  result  for  general  static  CMOS  logic  design: 


w  =  F'-(G  +  H)-(B  +  D'  +  E) 
p  1  -F'-C'-(G  +  H) 

Wn  =  W-  A' 

n  p 


(2.65) 

(2.66) 


The  equations  defining  the  variables  in  equations  (2.65)  and  (2.66)  are  refer¬ 
enced  in  Table  2.5. 

The  Wp  solved  for  is  the  mask  level  width  of  the  channel  of  each  p  MOSFET 
connected  directly  to  the  interconnection  node  between  the  p  and  n  blocks.  This 
width  must  be  maintained  for  all  MOSFETs  between  the  loadmost  p  MOSFETs 
and  the  supply  voltage.  Similarly,  the  Wn  solved  for  is  the  mask  level  width  of 
the  channel  of  each  n  MOSFET  connected  directly  to  the  interconnection  node 
between  the  n  and  p  blocks.  This  width  must  be  maintained  for  all  n  MOSFETs 
between  the  loadmost  n  MOSFETs  and  ground. 
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TABLE  2.5  IMPORTANT  EQUATIONS  FOR 
GENERAL  STATIC  CMOS  LOGIC  DESIGN 


Variable 

Defining  Equations(s) 

Wp 

(2.65) 

K 

(2.66) 

(2.64) 

B 

(2.42) 

and  (2.35) 

C' 

2.60 

D' 

2.61 

E 

2.43 

F' 

2.63 

G 

2.38 

H 

2.39) 

D.  EXTENSION  OF  THE  TRANSISTOR  SIZING  EQUATIONS  TO 
NON-RECTANGULAR  TRANSISTORS. 

To  this  point  the  equations  developed  for  transistor  sizing  have  been  discussed 
only  in  the  context  of  rectangular  shaped  transistors  of  the  sort  depicted  in  Figure 
2.11.  It  is  possible  to  apply  the  result  obtained  to  other  configurations  such  as  the 
Manhattan  geometry  star  in  Figure  2.22  and  the  circular  transistor  in  Figure  2.23. 
The  equations  in  Sections  B  and  C  are  based  on  rectangular  shaped  transistors  but 
only  in  the  sense  that  total  channel  lengths  and  widths  and  total  diffusion  lengths, 
widths,  perimeters  and  areas  were  defined  for  rectangular  areas.  As  long  as  the 
values  assigned  to  these  model  parameters  are  maintained  in  construction  it  is 
possible  to  apply  the  results  for  the  rectangular  configuration  to  any  configuration. 
For  example,  in  Figure  2.23  the  mask  level  channel  length  is  (r2  —  Tj)  and  the  mask 
level  channel  width  is  r2  •  (2  ■  it  —  9).  These  values  should  be  made  to  equal  Lp  and 
Wp  from  the  rectangular  model.  Similarly,  the  drain  area  should  be  made  to  equal 
that  of  the  rectangular  configuration.  In  this  way  the  results  of  Sections  B  and  C 
become  powerful  design  tools. 
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Figure  2.22  Manhattan  Geometry  Star  Configured  MOSFET 


Figure  2.23  Circular  MOSFET 
E.  CHAPTER  SUMMARY 

The  equations  presented  in  this  chapter  enable  one  to  size  transistors  to  meet 
desired  timing  performance  standards  under  known  loading  conditions,  operating 
temperature,  and  supply  voltage.  The  equations  developed  for  the  static  CMOS 
inverter  are  implemented  in  a  computer  program  that  is  included  in  Appendix  A. 
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SUPER  BUFFER  FOR  THE  MONTEREY  SILICON  COMPILER 


A.  BACKGROUND. 

A  need  exists  to  create  and  insert  into  the  Monterey  Silicon  Compiler  code  a 
mask  level  integrated  circuit  design  of  a  static  CMOS  super  buffer  [Malagon,  1987, 
p.  95].  To  understand  the  role  of  a  super  buffer  it  is  necessary  to  first  discuss  the 
circuit  that  it  is  a  part  of.  From  Sedra  and  Smith: 

“A  critical  aspect  of  the  design  of  any  logic  device,  particularly  those 
using  LSI  (Large-Scale  Integration)  and  VLSI  (Very-Large-Scale  Integra¬ 
tion)  circuit  technologies,  is  the  provision  of  clock  signals.  Difficulties 
arise  because  the  clock  signal  (whether  generated  on  or  off  the  chip)  must 
usually  feed  many  parts  of  the  circuit.  This  need  for  large  fan-out  can 
be  met  by  applying  the  clock  signal  to  a  string  of  cascaded  inverters  and 
feeding  the  output  of  each  inverter  to  a  different  part  of  the  circuit,  a 
structure  known  as  a  branching  fan-out  tree.  However,  problems  may 
arise  with  this  approach  as  a  result  of  the  differing  time  delays  that  the 
clock  signal  experiences  on  the  paths  to  the  various  parts  of  the  system. 

To  be  be  specific,  if  two  physically  remote  segments  of  a  logic  network 
must  intercommunicate,  it  is  essential  that  the  relative  time  variation,  or 
skew,  of  their  clocks  be  controlled  and  limited  to  ensure  reliable  operation. 

The  problem  in  VLSI  circuit  design  is  further  compounded  because 
large  clock  fan-out  and  long  clock  lines  imply  large  capacitive  loads  and 
two  conflicting  dangers:  very  slow  rise  and  fall  times  if  drive  current  is 
inadequate  or,  alternatively,  enormous  charging  and  discharging  currents 
if  the  driver  is  too  capable  and  fast  charging.  Thus,  the  design  of  clock 
systems  in  VLSI  circuits  is  a  challenging  problem,  involving  control  of 
signals  that  are  both  large  in  amplitude  and  in  rate  of  rise.”  [Sedra  and 
Smith,  1982,  p.  759] 

In  the  Monterey  Silicon  Compiler  architecture  the  effects  of  skew  are  minimized 
through  the  use  of  a  two  phase  non-overlapping  clocking  scheme  [Mullarky,  19S7. 
pp.  11-23].  The  need  for  large  fan-out  in  the  clock  circuit  is  satisfied  by  the 
super  buffer.  The  super  buffer  is  that  part  of  the  branching  fan-out  tree  that  feeds 
clock  signals  from  the  pad  where  off-chip  generated  clock  signals  arrive,  to  the 
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data  storage  registers  on  the  chip.  Since  the  super  buffer  is  a  gate  in  the  path  of 
incoming  clock  signals,  it  must  be  guaranteed  to  function  within  specified  timing 
constraints  under  given  loading  lest  the  operation  of  an  entire  chip  fail  due  to 
the  inability  of  the  clock  circuit  to  drive  the  on  chip  storage  registers  within  an 
acceptable  amount  of  time  delay.  Since  a  super  buffer’s  sole  function  is  to  increase 
the  drive  of  a  propagating  signal  it  can  be  constructed  by  cascading  two  inverters. 

Professor  Richard  W.  Hamming  of  the  Naval  Postgraduate  School  said  about 
systems  engineering,  “If  you  optimize  one  component  of  a  system  you  will  hate 
yourself  in  the  morning.”  The  idea  behind  the  statement  is  that  each  component  of 
a  system  must  be  designed  to  fit  in  well  with  all  the  other  system  components.  The 
clock  input  pads,  super  buffers,  and  storage  registers  form  a  complete  subsystem 
within  a  Monterey  Silicon  Compiler  generated  circuit.  CMOS  replacements  for  the 
NMOS  clock  input  pads  and  the  NMOS  storage  registers  already  exist,  therefore, 
the  replacement  CMOS  super  buffer  must  be  designed  with  this  previous  work  in 
mind. 

Figure  3.1  demonstrates  the  concept  of  employment  of  super  buffers  within 
the  Monterey  Silicon  Compiler  architecture.  In  the  figure  each  super  buffer  can  be 
seen  to  consist  of  two  cascaded  inverters.  Since  the  clocking  scheme  used  involves 
two  clocks,  two  clock-signals  must  be  delivered  to  each  register.  Thus,  the  super 
buffers  must  be  employed  in  pairs  as  can  be  seen  in  Figure  3.1.  Each  pair  supplies 
both  clock  signals  to  a  bank  of  data  registers  that  straddles  the  data  path. 

In  Figure  3.1  the  inverter  symbols  within  each  super  buffer  are  drawn  so  that 
the  output  stage  inverter  is  larger  than  the  input  stage  inverter.  This  is  intended 
to  be  a  reflection  of  the  physical  construction  necessary  to  achieve  the  desired 
electrical  behavior  of  the  super  buffer.  The  super  buffer  output  stage  must  be 
capable  of  driving  a  large  fan-out.  The  input  stage  must  be  capable  of  driving 
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the  gate  capacitance  of  the  output  stage  and  at  the  same  time  present  an  input 
capacitance  smaller  than  the  output  stage  input  capacitance.  The  MOSFETs  in 
the  output  stage  must,  therefore,  be  larger  than  those  in  the  input  stage. 


The  design  task  is  to  find  all  of  the  dimensions  of  the  input  and  output  stage 
inverters  given  the  following: 

•  worst-case  circuit  operating  temperature 

•  worst-case  supply  voltage 

•  worst-case  gate  voltage 

•  worst-case  SPICE  fabrication  parameters 

The  equations  developed  in  Chapter  II  are  ideally  suited  to  soi  ing  this  prob¬ 
lem.  The  procedure  is  documented  in  the  next  section.  Worst-case  conditions  are 
referred  to  above  because  worst-case  design  is  essential  for  reliable  circuit  opera¬ 
tion.  The  performance  of  the  super  buffer  with  respect  to  time  is  of  the  utmost 
concern  in  this  design.  Therefore,  the  worst-case  parameters  to  be  used  in  the 
design  process  are  those  that  restrict  current  flow  and  thereby  slow  down  the  oper¬ 
ation  of  the  super  buffer.  These  include:  high  circuit  operating  temperatures,  low 
supply  voltage,  low  gate  voltage  (which  is  inherent  to  low  supply  voltage  in  fully 
restored  static  CMOS  logic),  and  minimum  current  SPICE  fabrication  parameters 
(obtained  from  the  silicon  foundry  where  the  circuit  is  to  be  constructed).  A  circuit 
designed  to  function  properly  under  these  punishing  conditions  will  function  faster 
in  a  less  severe  environment.  Of  course,  a  circuit  may  be  designed  to  meet  less 
hostile  conditions  if  it  is  not  necessary  for  the  circuit  to  meet  the  design  perfor¬ 
mance  standards  under  such  extreme  circumstances.  In  that  case,  nominal  values 
for  the  supply  voltage  and  SPICE  fabrication  parameters  might  be  used  in  die 
design  specifications  as  well  as  some  lesser  operating  temperature. 
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In  the  design  of  microchips  intended  for  commercial  use,  semiconductor  man¬ 
ufacturers  typically  guarantee  their  devices  for  operation  over  a  temperature  range 
of  —40°  C  to  +85°  C  and  over  a  supply  voltage  range  of  4.5  V  to  6  V.  [Signetics 
Corp.,  1986,  p.  5-4].  For  military  applications  both  of  these  ranges  axe  broader. 
In  keeping  with  industrial  practice,  the  super  buffer  is  designed  for  guaranteed 
operation  within  a  specified  timing  constraint  at  85°  C  (358.15  K)  with  a  supply 
voltage  (and  gate  voltage)  of  4.5  V.  The  minimum  current  SPICE  fabrication  pa¬ 
rameters  chosen  are  from  the  Metal-Oxide-Semiconductor  Implementation  Service 
(MOSIS)  sponsored  by  DARPA  at  the  Information  Sciences  Institute,  University 
of  Southern  California.  The  parameters  are  included  in  Appendix  B.  The  desired 
rise  time  and  fall  time  at  the  output  of  each  stage  of  the  super  buffer  is  chosen  to  be 
2.0  nanoseconds.  The  total  average  delay  time  desired  is  therefore  (from  equation 
(2.16)): 


=  td  -f- 1. 

»vg, input  stage  “avg, output  stage 

=  4-(t,  +t,  )  +  i(<r  +t,  ) 

\  input  stage  'input  stage  /  \  output  stage  /  output  stage  / 

=  f(2.0x  10  "9 +2. Ox  10"9)+i-(2.0xl0_9+2.0x  10 -9) 

=  2.0  x  10-9  [s] 


B.  DESIGN  OF  THE  SUPER  BUFFER. 

The  number  of  registers  across  the  width  of  the  data  path  in  Figure  3.1  is  equal 
to  the  number  of  bits  in  the  word  that  propagates  along  the  path.  In  order  to  deter¬ 
mine  the  loading  that  the  super  buffer  must  be  designed  to  support,  an  upper  limit 
on  the  number  of  registers  that  can  be  supported  under  a  worst-case  scenario  must 
be  chosen.  As  this  number  becomes  larger,  the  size  of  the  super  buffer  required  also 
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becomes  larger.  Due  to  the  size  constraint  imposed  by  the  physical  layout  of  all 
floorplans  designed  by  the  Monterey  Silicon  Compiler,  the  number  of  supportable 
registers  under  a  worst-case  scenario  is  chosen  to  be  four.  For  a  number  larger  than 
that  the  static  CMOS  super  buffer  would  have  to  be  significantly  larger  than  the 
NMOS  super  buffer  that  it  replaces  and  it  simply  would  not  fit  within  the  space 
presently  allotted  for  it  on  the  floorplan.  This,  however,  is  the  number  of  registers 
that  can  be  driven  in  2  nanoseconds  if  the  worst-case  scenario  is  assumed.  As  dis¬ 
cussed  in  Section  A,  the  worst-case  scenario  implies  that  the  circuit  is  constructed 
using  worst-case  minimum  current  SPICE  fabrication  parameters,  that  the  supply 
voltage  is  only  4.5  volts,  and  that  the  circuit  operates  at  85°  C.  Since  the  super 
buffer  is  designed  under  a  worst-case  assumption,  operating  it  under  less  hostile 
conditions  will  enhance  its  performance.  If  the  capacitive  loading  on  the  super 
buffer  remains  constant  but  operating  conditions  improve,  the  super  buffer  will 
function  faster  than  the  2  nanosecond  delay  time  upper  limit.  The  actual  speed 
can  be  calculated  using  equation  (2.56).  Conversely,  improved  operating  conditions 
offer  the  opportunity  to  drive  more  capacitance  (more  registers)  without  affecting 
the  speed  of  operation  of  the  super  buffer.  If  the  worst-case  scenario  does  occur 
and  the  data  path  is  wider  than  four  bits  then  the  super  buffer  will,  of  course,  still 
function,  but  it  will  no  longer  meet  the  2  nanosecond  design  goal.  The  performance 
of  the  super  buffer  under  the  loading  caused  by  four,  eight,  sixteen,  and  thirty-two 
bit  data  paths  is  discussed  in  Section  E.  Thus,  the  Monterey  Silicon  Compiler  is 
not  by  any  means  restricted  to  the  construction  of  four  bit  circuits.  As  with  any 
integrated  circuit,  the  maximum  clock  frequency  at  which  the  super  buffers  can 
operate  is  governed  by  temperature,  capacitive  loading,  supply  voltage,  and  fabri¬ 
cation  process.  The  better  these  conditions  are,  the  faster  the  super  buffers  will 
operate;  the  worse  they  are,  the  slower  the  super  buffers  will  operate. 


Figure  3.2  shows  the  mask  level  layout  of  a  single  data  register  [Mullarky,  1987, 
p.  56].  The  labels  PHIl!  and  PHI2!  on  Figure  3.2  correspond  to  the  phil  and  phi2 
clocklines  in  Figure  3.1.  The  output  stage  of  one  super  buffer  of  a  super  buffer  pair 
is  connected  to  the  input  labeled  PHIl!  in  Figure  3.2.  The  output  stage  of  the 
other  super  buffer  in  the  pair  is  connected  to  the  input  labeled  PHI2!  in  Figure 
3.2.  Looking  into  the  register  at  PHIl!  the  super  buffer  sees  three  n  MOSFETs 
and  a  p  MOSFET,  all  in  parallel  and  each  having  mask  level  channel  dimensions 
( L  x  W)  equal  to  3.0  microns  by  4.5  microns. 

The  programs  in  Appendix  A  are  used  to  calculate  the  gate  capacitance  seen 
by  the  super  buffer  looking  into  the  PHIl!  input  of  the  register.  To  run  the 
programs  on  the  HP41CX  they  must  all  first  be  stored  in  extended  memory  and 
main  memory  must  be  cleared.  The  name  “VLSI”  is  stored  in  the  alpha  register 
and  execution  continues  as  follows: 

GETP  then  R/S  are  executed.  The  program  responds: 

STOP.  PURGE  MAIN  MEMORY 

The  programs  axe  so  large  that  they  cannot  all  be  held  in  main  memory  at 
once.  They  must  be  swapped  in  and  out  of  extended  memory  as  needed.  The 
prompt  is  saying  that  the  program  VLSI  should  be  the  only  program  currently  in 
main  memory.  If  it  is  not,  all  programs  in  main  including  VLSI  must  be  purged 
and  the  process  started  over  again.  R/S  is  executed  again.  The  program  responds: 

ENTER  FAB.  PARAM.  CARD 

The  programs  require  that  the  SPICE  fabrication  parameters  be  entered. 
These  should  be  stored  on  magnetic  cards  in  accordance  with  the  data  register 
allocation  detailed  in  Appendix  A.  The  cards  are  entered  and  the  program  re¬ 
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The  response  for  worst-case  operating  temperature  is  358.15.  R/S  is  executed. 
The  program  responds: 

VBS  (  N  ),  VOLTS  =  ? 

Since  a  p+  substrate  contact  will  be  placed  on  the  actual  device  so  that  it 
abuts  the  n  source  diffusion  at  the  ground  line  contact,  Vbsn  is  equal  to  zero.  R/S 
is  executed.  The  program  responds: 
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VBS  {  P  ),  VOLTS  =  ? 

Since  an  n+  substrate  contact  will  be  placed  on  the  actual  device  so  that  it 
abuts  the  p  source  diffusion  at  the  supply  voltage  line  contact,  Vbsp  is  equal  to 
zero.  R/S  is  executed.  The  program  responds: 

SUPPLY  VOLTAGE  (  VOLTS  )  =  ? 

The  response  for  worst-case  supply  voltage  is  4.5.  R/S  is  executed.  The 
program  responds: 

P  CHNL  MASK  LNGTH  (  MTR  )  =  ? 

At  the  end  of  Section  B.2  in  Chapter  II  it  is  stated  that  the  channel  mask 
length  should  be  chosen  to  be  the  minimum  feature  size  for  the  technology  being 
used.  Since  the  technology  being  used  in  the  Monterey  Silicon  Compiler  is  based  on 
a  p  well  process  with  three  micron  minimum  feature  size  the  response  is  3.0  x  10-6 . 
R/S  is  executed.  The  program  responds: 

N  CHNL  MASK  LNGTH  (  MTR  )  =  ? 

Again,  the  response  is  3  x  10~6.  R/S  is  executed.  The  program  responds  with 
a  menu: 


WI  LD 

Choosing  WI  would  go  into  the  transistor  sizing  algorithm.  Since  the  immediate 
goal  is  to  find  the  gate  capacitance  seen  by  the  super  buffer  looking  into  the  PHI  1 ! 
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input  of  the  register,  LD  is  chosen,  where  LD  is  short  for  “load”.  The  program 
responds  with  an  introductory  line  immediately  followed  by  a  query: 

LOADCAP 

LOAD  CHNL  MASK  LNGTH  (  MTR  }  ? 

The  total  gate  capacitance  looking  in  at  PHIl!  of  the  register  is  calculated 
starting  with  the  three  n  MOSFETs.  The  length  for  all  three  is  3.0  microns.  R/S 
is  executed.  The  program  responds: 

LOAD  CHNL  MASK  WIDTH  {  MTR  )  ? 

The  width  for  all  three  n  MOSFETs  is  4.5  microns.  R/S  is  executed.  The 
program  responds  with  a  query  followed  by  a  menu: 

DEVICE  TYPE? 

N  P 

N  is  chosen  to  indicate  to  the  program  that  the  device  is  an  n  MOSFET. 

The  program  responds  with  another  query  followed  by  another  menu: 

ANOTHER  DEVICE? 

Y  N 

The  response  is  Y  for  yes.  The  program  responds. 

LOAD  CHNL  MASK  LNGTH  (  MTR  )  ? 

The  same  procedure  is  continued  by  responding  to  the  prompts  as  appropriate 
until  all  three  n  MOSFETs  and  the  p  MOSFET  have  been  entered.  At  that  point 
the  response  to  the  query,  “ANOTHER  DEVICE”,  is  N  for  no,  and  the  program 
responds  with  the  total  capacitance  of  the  four  MOSFETs  connected  in  parallel: 

C  E  LOAD  DEV  =  43.43E  -  15  (  FARAD  ) 

R/S  is  executed  and  the  menu  that  gives  a  choice  between  the  transistor  sizing 
algorithm  and  the  load  capacitance  algorithm  appears  again.  Looking  into  the 
register  at  PHI2!  the  other  super  buffer  in  the  pair  sees  three  p  MOSFETs  and  an 


n  MOSFET,  all  in  parallel  and  each  having  mask  level  channel  dimensions  (L  x  W) 
equal  to  3.0  microns  by  4.5  microns.  In  terms  of  the  number  of  each  type  of  device 
this  is  the  opposite  of  what  is  seen  by  the  other  super  buffer.  This  leads  to  a  slight 
difference  between  the  gate  capacitance  seen  by  each  super  buffer  of  the  pair.  The 
load  capacitance  algorithm  determines  the  gate  capacitance  seen  looking  into  the 
register  at  PHI2!  to  be  39.5  femtofarads.  The  difference  between  the  capacitance 
seen  by  the  two  loads  is  not  so  great  as  to  warrant  the  construction  of  two  entirely 
different  super  buffers  to  make  up  the  pair.  It  does  however  suggest  that  if  only  one 
basic  super  buffer  is  to  be  designed  and  employed  in  pairs  then  the  basic  super  buffer 
should  be  designed  to  support  the  larger  of  the  two  loads,  43.43  femtofarads.  For 
each  super  buffer  to  drive  four  registers  requires  that  it  be  capable  of  driving  four 
times  this  amount.  Thus,  C ^gate  for  each  super  buffer  =  (4)  •  (43.43  x  10~15 )  = 
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173.7  x  10~15  Farads.  There  is  one  other  element  of  capacitive  loading  not  yet 
determined  that  is  required  in  the  calculations;  the  routing  capacitance.  For  a 
four  bit  data  path  the  Monterey  Silicon  Compiler  typically  places  the  registers  so 
that  the  metal  line  connecting  the  clock  input  on  each  register  to  the  super  buffer 
output  is  187.5  microns  long  per  register  and  4.5  microns  wide.  Thus,  for  a  four 
bit  data  path  the  routing  capacitance  of  the  line  extending  from  each  super  buffer 
in  the  pair  to  the  four  registers  it  supports  is: 

Cr  =  (4)  •  (4.5  x  10~6)  ■  (187.5  x  10~6)  •  Cmd 

where  Cmd  is  the  capacitance  formed  between  metal  and  substrate.  Cmd  is  taken 
to  be  100  x  10-6  [VVeste  and  Eshraghian,  1985,  p.  135].  Substituting  this 
value  for  Cmd  gives: 


Cr  =  337.5  femtofarads 


Armed  with  the  knowledge  that  C  gate  =  173.7  x  10  15  Farads,  and  that 
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Cr  =  337.5  x  10“ 15  Farads  for  each  super  buffer  the  transistor  sizing  algorithm 
may  now  be  used  to  determine  the  dimensions  of  the  p  and  n  MOSFETs  in  the 
output  stage  of  the  super  buffer. 

Returning  once  again  to  the  menu  that  gives  a  choice  between  the  transistor 
sizing  algorithm  and  the  load  capacitance  algorithm  the  choice  “WI”  is  selected. 
The  program  responds  with  a  query  followed  by  a  menu: 

DO  YOU  KNOW  C  E  LOAD  DEV? 

YES  NO 

Since  CD  LOAD  DEV  (=  C  ^gate)  has  already  been  calculated  the  response 
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is  YES.  The  program  responds: 

LOAD  DEV  (  FARAD  )  =  ? 

The  response  is  173.7  x  10-15.  R/S  is  executed  and  the  program  responds: 
DESIRED  RISE  TIME  {  SEC  }  =  ? 

The  time  that  has  been  chosen  is  2  x  10~9  seconds.  R/S  is  executed.  The 
program  responds: 

ROUTING  CAP  (  FARAD  )  =  ? 

Cr  has  been  calculated  to  be  337.5  x  10— 15  Farads.  It  is  interesting  to  note 
that  this  routing  capacitance  is  about  double  the  gate  capacitance  of  the  load.  R/S 
is  executed.  The  program  responds: 

P  DRAIN  MASK  LNGTH  (  MTR)  ? 

The  minimum  drain  length  for  the  technology  is  selected  in  order  to  reduce 
drain  diffusion  capacitance  to  a  minimum.  The  value  is  3  x  10-6.  R/S  is  executed. 
The  program  responds: 

N  DRAIN  MASK  LNGTH  (  MTR  )  ? 
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As  with  the  p  drain  diffusion  length,  the  value  selected  is  the  minimum  per¬ 
mitted.  The  value  is  3  x  10~6.  R/S  is  executed.  The  program  responds: 

NO.  P  DIFFUSION  CONTACTS 

The  program  needs  to  know  how  many  diffusion  to  metal  contacts  are  going  to 
be  placed  around  the  p  drain  diffusion  area.  The  choice  is  three.  R/S  is  executed 
and  the  program  responds: 

P  DIFCTC  MASK  LNGTH  {  MTR  )  ? 

The  value  requested  is  the  length  of  the  diffusion  to  metal  contacts  placed 
around  the  p  drain  diffusion.  To  minimize  drain  capacitance  the  minimum  dimen¬ 
sion  supported  by  the  technology  is  chosen.  The  value  is  6  x  10~6.  This  is  the 
same  value  for  the  length  of  the  diffusion  to  metal  contacts  placed  around  the  n 
dram  diffusion  els  well  as  for  the  width  of  the  contacts  placed  around  both  drain 
diffusions.  R/S  is  executed  and  the  program  responds: 

PDIFCTC  MASK  WIDTH  (  MTR  )  ? 

The  response  is  6  x  10“6.  R/S  is  executed  and  the  program  responds: 

NO.  N  DIFFUSION  CONTACTS 

One  diffusion  to  metal  contact  will  be  placed  on  the  n  drain  diffusion  area. 
R/S  is  executed.  The  program  responds: 

N  DIFCTC  MASK  LNGTH  {  MTR  )  ? 

The  response  is  6  X  10-6.  R/S  is  executed.  The  program  responds: 

N  DIFCTC  MASK  WIDTH  {  MTR  )  ? 

The  response  is  6  x  10~6.  R/S  is  executed.  The  program  responds,  with  all 
the  dimensions  (as  defined  in  Table  2.2)  of  the  n  and  p  MOSFETs  in  the  output 
stage  of  the  super  buffer  that  are  required  for  a  complete  modeling  of  the  output 
stage  with  PSpice: 
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L  <  P  )  =  3  x  10"6  {  MTR  ) 

W  {  P  )  =  134.3  x  1(T6  {  MTR  ) 

AD  {  P  )  =  660.9  x  10"12  {  SQ.  MTR  ) 

PD  (  P  )  =  314.4  x  10~6  {  MTR  ) 

RD  (  P  )  =22.16  {  OHM  ) 

L  {  N  )  =  3  x  10"6  {  MTR  ) 

W  {  N  )  =  51.93  x  10"6  (  MTR  } 

AD  (  N  )  =  231.2  x  10"12  (  SQ.  MTR  ) 

PD  {  N  >  =  124.4  x  10"6  (  MTR  ) 

RD  (  N  )  =  29.19  {  OHM  ) 

The  program  then  returns  to  the  very  beginning.  The  program  is  run  again 
in  its  entirety  to  determine  the  dimensions  of  the  n  and  p  MOSFETs  in  the  input 
stage  of  the  super  buffer.  C^gate  for  the  input  stage  is  that  capacitance  that  is 
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due  to  the  gates  of  the  n  and  p  MOSFETs  of  the  output  stage.  CT  is  negligible  for 
the  input  stage  as  the  extent  of  the  routing  between  the  input  and  output  stages  is 
minimal.  The  drain  diffusion  length  and  contact  dimensions  are  again  are  chosen  to 
be  the  minimum  supportable  by  the  technology  to  minimize  the  capacitance  that 
must  be  driven.  Two  metal  to  diffusion  contacts  axe  chosen  for  the  p  drain  diffusion 
area  and  one  is  chosen  for  the  n  drain  diffusion  area.  The  results  of  factoring  these 
variables  into  the  program  in  combination  with  the  worst-case  temperature,  supply 
voltage,  and  fabrication  parameters  are  all  the  dimensions  (as  defined  in  Table  2.2) 
of  the  n  and  p  MOSFETs  in  the  input  stage  of  the  super  buffer  that  are  required 
for  a  complete  modeling  of  the  input  stage  with  PSpice: 

L  (  P  )  =  3  x  10~6  (  MTR  ) 

W  (  P  )  =  106.1  x  lO"6  (  MTR  ) 


AD  {  P  >  =  507.3  x  10”12  {  SQ.  MTR  ) 
PD  {  P  )  =  246  x  10"8  <  MTR  ) 

RD  {  P  )  =  32.76  (  OHM  } 

L  {  N  )  =  3  x  10-6  {  MTR  ) 

W  {  N  )  =  41.01  x  10~8  {  MTR  ) 
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Figure  3.3  Schematic  of  Static  CMOS  Super  Buffer 
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A  schematic  drawing  of  the  super  buffer  with  all  the  calculated  dimensions 
affixed  is  presented  in  Figure  3.3.  The  load  capacitance  on  the  output  stage  is 
merely  the  sum  of  C  gate  +  Cr  calculated  for  the  output  stage.  Note  that  node 

lo  fcd 
devices 

numbers  (0,1,  etc.)  and  transistor  names  (Ml,  M2,  etc.)  have  been  assigned  in  the 
figure.  These  node  numbers  and  transistor  names  correspond  to  the  nodes  used 
in  the  actual  PSpice  deck  as  it  appears,  ready  for  simulation,  in  Appendix  B.  The 
worst-case,  minimum  current  SPICE  fabrication  parameters  are  also  included  in 
the  PSpice  deck  in  Appendix  B. 

C.  SIMULATION  OF  THE  SUPER  BUFFER. 

The  results  of  simulating  the  PSpice  deck  in  Appendix  B  are  displayed  in 
Figures  3.4,  3.5,  and  3.6.  In  each  figure  voltage  is  plotted  as  a  function  of  time. 
The  voltages  displayed  are  referred  to  by  node  numbers  which  correspond  to  those 
used  in  Figure  3.3  and  Appendix  B.  Voltages  are  measured  with  respect  to  two 
nodes.  Where  only  one  node  is  listed,  the  second  is  taken  to  be  node  0,  which  in 
the  case  of  this  circuit  is  ground.  Thus,  Figure  3.4  shows  the  voltage  waveforms 
presented  to  and  produced  by  the  input  stage  of  the  super  buffer.  The  value  of  1.0 
volt  that  is  subtracted  from  the  input  voltage  wave  form  is  the  threshold  voltage 
of  the  n  device.  Similarly  the  value  of  j  —  1.0|  volt  that  is  subtracted  from  the 
absolute  value  of  V(2,l)  is  the  absolute  value  of  the  threshold  voltage  of  the  p 
device.  These  waveforms  are  included  to  assist  in  determining  when  the  devices 
are  in  their  various  regions  of  operation:  linear,  saturation,  and  cutoff  as  discussed 
in  Chapter  II. 

The  simulation  result  in  Figure  3.4  is  almost  identical  to  that  predicted  by 
theory  in  Figures  2.9a  and  2.10a  (the  small  spikes  are  due  to  anomalies  of  the 
model).  This  is  an  important  result  because  the  model  used  to  prepare  Figures 
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Figure  3.4  PS  pice  PROBE  Postprocessor  Simulation  of  Voltage  Wave¬ 
forms  Presented  to  and  Produced  by  the  Input  Stage  of  the 
Super  Buffer  Using  MOSFET  Level  Two  Model 
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2.9a  and  2.10a  was  based  on  the  Shichman- Hodges  model  associated  with  level  one 
SPICE  MOSFET  simulations  while  the  model  used  in  the  simulation  that  resulted 
in  Figures  3.4,  3.5,  and  3.6  was  a  different  one.  The  foundation  of  the  equations 
developed  in  Chapter  II  rests  on  the  Shichman-Hodges  model  of  MOSFET  behav¬ 
ior.  To  confirm  the  validity  of  the  approach  taken  in  developing  the  equations  of 
Chapter  II  a  totally  independent  model  was  selected  for  simulation;  the  level  two 
SPICE  MOSFET  model  based  on  the  work  of  Vladimirescu  and  Liu  [Electronics 
Research  Laboratory,  1980,  pp.  1-23]. 

Figure  3.5  shows  the  voltage  waveforms  presented  to  and  produced  by  the 
output  stage  of  the  super  buffer.  Figure  3.6  shows  the  voltage  waveform  presented 
to  the  input  stage  of  the  super  buffer  and  the  voltage  waveform  produced  by  the 
output  stage  of  the  super  buffer. 

Table  3.1  summarizes  the  rise  and  fall  time  data  presented  in  Figures  3.4, 
3.5,  and  3.6.  Since  the  performance  target  is  that  tr  and  t f  for  the  output  voltage 
waveforms  of  both  the  input  stage  and  the  output  stage  not  exceed  2  x  10~9  seconds 
the  table  shows  that  success  has  been  achieved.  The  rise  time  of  the  voltage 
waveform  produced  by  the  input  stage  is  12%  faster  than  the  2  nanosecond  goal. 
The  fall  time  of  this  voltage  waveform  is  37%  faster  than  the  goal.  The  rise  time 
of  the  voltage  waveform  produced  by  the  output  stage  is  6%  faster  than  the  two 
nanosecond  goal.  The  fall  time  of  this  waveform  is  19.5%  faster  than  the  goal. 
Rough  symmetry  is  achieved  in  rise  and  fall  times  at  the  output  of  each  stage. 

Delay  time  based  on  the  projected  rise  and  fall  times  should  be  1  nanosecond 
for  each  stage  or  2  nanoseconds  for  both  stages  in  cascade  (based  on  equation 
(2.16)).  Table  3.2  summarizes  the  delay  time  data  of  Figures  3.4,  3.5,  and  3.6. 
Table  3.2  shows  that  the  average  time  delay  through  the  input  stage  is  25%  faster 
than  predicted.  The  average  time  delay  through  the  output  stage  is  27%  slower 
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Figure  3.5  PSpice  PROBE  Postprocessor  Simulation  of  Voltage  Wave¬ 
forms  Presented  to  and  Produced  by  the  Output  Stage  of 
the  Super  Buffer  Using  MOSFET  Level  Two  Model. 
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Figure  3.6  PSpice  PROBE  Postprocessor  Simulation  of  Voltage  Wave¬ 
forms  Presented  to  the  Input  Stage  of  the  Super  Buffer  and 
Produced  by  the  Output  Stage  of  the  Super  Buffer  Using 
MOSFET  Level  Two  Model. 
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TABLE  3.1  SUMMARY  OF  RISE  AND  FALL 
TIME  DATA  FOR  THE  SUPER  BUFFER 


voltage  waveform  at  input 
to  input  stage 

voltage  waveform  at  output 
of  input  stage  (same  as 
at  input  to  the  output  stage 

voltage  waveform  at 
output  of  output  stage 

than  predicted.  Finally,  the  average  time  delay  through  the  entire  super  buffer  is 
within  1%  of  the  projected  value  of  two  nanoseconds.  Note  that  in  Table  3.2  the 
sum  of  the  column  entries  do  not  sum  exactly  to  the  values  in  the  third  row.  This 
is  because  all  the  data  in  the  table  is  taken  directly  from  discrete  data  points  that 
are  used  to  draw  the  voltage  waveforms  in  Figures  3.4,  3.5,  and  3.6.  Some  linear 
interpolation  is  necessary  in  order  to  obtain  the  time  at  the  50%  voltage  levels. 
This  accounts  for  the  slight  differences  between  the  sums  of  the  column  entries  and 
the  third  row  entries. 

Thus  the  correct  timing  is  achieved  and  the  validity  of  the  equations  in  Chap¬ 
ter  II  is  borne  out  by  simulation  with  an  independent  model. 

The  low  noise  margin  and  high  noise  margin  for  both  stages  givi-v.  a  5  volt 
supply  voltage  is  calculated  to  be  2.6  volts  and  1.7  volts,  respectively  [Weste  ic 
Eshraghian,  1985,  pp.  507-508]. 

D.  MASK  LEVEL  SUPER  BUFFER  IMPLEMENTATION 

A  mask  level  integrated  circuit  layout  of  the  super  buffer  based  on  the  di¬ 
mensions  calculated  is  shown  in  Figure  3.7.  To  reduce  the  possibility  of  latch-up. 


tr(seconds)  if  (seconds) 


8  x  10”12 

10  x  10“12 

1.76  x  10“9 

1.259  x  10"9 

1.877  x  10"9 

1.61  x  10"9 

TABLE  3.2  SUMMARY  OF  DELAY  TIME 
DATA  FOR  THE  SUPER  BUFFER 


tdT{seconds) 

td{  (seconds) 

"4"  ^ d  ^ 

^rfavg  2 

delay  through 
input  stage 

0.72  x  10"9 

0.773  x  10~9 

0.746  x  10“9 

delay  through 
output  stage 

1.093  x  10"9 

1.45  x  10~9 

1.272  x  10“9 

delay  through  input 
and  output  stages 
in  cascade 

1.866  x  10"9 

2.17  x  10~9 

2.018  x  10~9 

substrate  contacts  are  employed  in  the  n+  and  p+  source  diffusion  regions.  To 
minimize  the  chip  area  of  the  super  buffer  requires  that  the  transistors  be  laid  out 
in  serpentine  patterns.  As  a  final  check  of  the  circuit’s  functionality,  the  mask  level 
design  is  simulated  with  the  ESIM  gate  level  simulator.  The  simulation  indicates 
that  the  mask  layout  is  logically  correct. 

To  use  the  super  buffer  layout  in  the  automated  generation  of  custom  mi¬ 
crochips  with  the  Monterey  Silicon  Compiler,  the  Caltech  Intermediate  Form  (CIF) 
representation  of  the  circuit  is  translated  into  an  L5  form  representation  using  a 
CIF  to  Lo  conversion  program  created  by  E.  Malagon  [Malagon,  1987,  p.  110]. 
CIF  and  L5  are  two  different  languages  used  to  represent  the  geometric  shapes 
that  comprise  a  mask  level  layout.  L5  stands  for  Lincoln  Laboratory  Lisp-based 
Layout  Language.  The  language  was  created  at  the  Massachusetts  Institute  of 
Technology  Lincoln  Laboratory  under  the  sponsorship  of  the  US  Defense  Advanced 
Research  Projects  Agency.  The  Monterey  Silicon  Compiler  requires  that  the  mask 
level  integrated  circuit  layout  be  represented  in  L5  form  before  installing  the  layout 
representation  into  the  compiler  code.  After  the  design  is  translated  from  CIF  to 
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L5  it  is  inserted  into  the  compiler  code  within  the  program  data-path.l  using  a  pro¬ 
cedure  proposed  by  Baumstarck  [Baumstarck,  1987,  pp.  70-81].  Some  additional 
wiring  is  added  in  L5  code  to  ensure  that  the  super  buffer  cell  is  correctly  placed 
on  the  floorplan  each  time  it  is  called  by  the  compiler  and  to  ensure  that  signals 
are  properly  routed  to  and  from  the  super  buffer. 

An  example  of  a  portion  of  a  microchip  designed  using  the  replacement  static 
CMOS  super  buffer  is  presented  in  Figure  3.8.  In  the  figure  the  horizontal  rails  at 
the  center  of  the  figure  axe  the  supply  voltage,  ground,  and  clock  distribution  rails. 
The  outputs  at  the  top  of  the  two  super  buffers  that  make  up  the  super  buffer  pair 
that  straddles  the  rails  tire  wired  to  PHIl!  and  PHI2!  of  the  register  directly  above 
it  in  the  data  path. 

Figure  3.9  shows  an  entire  4  bit  shifter  microchip  designed  with  the  Monterey 
Silicon  Compiler  using  the  static  CMOS  super  buffer  and  CMOS  register.  This 
chip  is  in  fact  a  hybrid  chip  in  that  both  NMOS  and  CMOS  technologies  are 
incorporated  in  the  chip.  The  input  and  output  pads  around  the  outer  ring  of  the 
circuit  are  NMOS.  A  collection  of  CMOS  pads  for  the  Monterey  Silicon  Compiler 
has  been  obtained  but  has  not  yet  been  translated  into  L5  and  inserted  into  the 
compiler  code.  The  CMOS  input  pad  of  this  collection  is  the  one  that  has  been 
referred  to  in  this  chapter.  It  is  presented  in  Figure  3.10. 

E.  PERFORMANCE  OF  THE  SUPER  BUFFER  IN  MULTI-BIT  DATA 
PATH  CIRCUITS 

Section  C  documents  the  performance  of  the  super  buffer  under  the  wc-st-case 
scenario  given  the  loading  caused  by  a  four  bit  data  path.  In  section  B  it  is  stated 
that  improving  the  operating  conditions  or  the  fabrication  conditions  of  the  super 
buffer  or  both  results  in  enhanced  performance.  Using  equation  (2.56)  the  per¬ 
formance  of  the  super  buffer  can  be  predicted  under  different  loading  conditions. 
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Figure  3.8  Example  of  a  Portion  of  a  Microchip  Designed  with  the 
Monterey  Silicon  Compiler  Showing  Placement  of  the  Static 
CMOS  Super  Buffer  Below  a  CMOS  Register. 
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Table  3.3  contains  entries  generated  from  equation  (2.56)  that  predict  the  perfor¬ 
mance  of  the  super  buffer  with  the  same  operating  temperature  and  fabrication 
process  as  under  the  worst-case  scenario  only  the  supply  voltage  has  been  raised 
from  4.5  volts  to  6.0  volts.  As  can  be  seen  in  the  table  the  effect  of  simply  raising 
the  supply  voltage  by  1.5  volts  allows  the  super  buffer  to  drive  an  eight  bit  data 
path  to  within  3%  of  the  2  nanosecond  worst-case  design  goal. 

TABLE  3.3  SUMMARY  OF  RISE  AND  FALL  TIME  DATA  FOR 
THE  SUPER  BUFFER  IN  MULTI-BIT  DATA  PATH  CIRCUITS 

WITH  VDD  =  6  VOLTS 


tr(=tf) 
for  4  bits 
seconds 

*r(=tf) 

for  8  bits 
seconds 

tr(=tf) 
for  16  bits 
seconds 

<r(=f/) 
for  32  bits 
seconds 

voltage  waveform 
at  input  to 
input  stage 
(ideal  square  wave) 

0 

0 

0 

0 

voltage  waveform 
at  output  of 
input  stage  (same 
as  at  input  to  the 
output  stage) 

1.652  x  10~9 

1.652  x  10"9 

1.652  x  10“9 

1.652  x  10~9 

voltage  waveform 
at  output  of 
output  stage 

1.5S6  x  IQ"9 

2.054  x  10"9 

2.99  x  10"9 

4.862  x  10"9 

Table  3.4  contains  the  predicted  delay  time  data  for  the  super  buffer  under  the 
same  conditions  as  discussed  above.  The  values  are  generated  by  equation  (2.17) 
using  the  entries  in  Table  3.3.  Here  too,  the  super  buffer  is  seen  to  be  capable  of 
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driving  an  eight  bit  data  path  in  2  nanoseconds  instead  of  only  four  bits.  This  is 
due  simply  to  the  increase  in  the  supply  voltage.  The  table  shows  that  even  for 
a  32  bit  data  path  the  average  delay  time  through  the  super  buffer  is  still  a  very 
respectable  3.26  nanoseconds 

TABLE  3.4  SUMMARY  OF  DELAY  TIME  DATA  FOR  THE 
SUPER  BUFFER  IN  MULTI-BIT  DATA  PATH  CIRCUITS 


WITH  VDD  =  6  VOLTS 


delay  through 
input  stage 


“*vg 

for  4  bits 
seconds 


0.826  x  10- 


for  8  bits 
seconds 


0.826  x  10" 


t.  for 
for  16  bits 
seconds 


0.826  x  10' 


for  32  bits 
seconds 


0.826  x  10- 


delay  through 

output  stage  0.793  x  10~9  1.027  x  10~s  1.495  x  10~9  2.431  x  10~9 


delay  through 

input  and  output  1.619  x  10-9 
stages  in  cascade 


1.853  x  10~9  2.321  x  10~9  3.257  x  10"9 


Figure  3.11  shows  an  example  of  an  eight  bit  microchip  design  for  a  taxi  meter 
controller  generated  using  the  Monterey  Silicon  Compiler  with  the  static  CMOS 
super  buffer  in  place  [Massachusetts  Institute  of  Technology  19S2  Conference  on 
Advanced  Research  in  VLSI,  1982,  p.  32].  Tables  3.3  and  3.4  demonstrate  that 
by  improving  the  supply  voltage  the  drive  of  the  super  buffer  doubles.  Similar 
improvement  in  the  drive  of  the  super  buffer  can  be  realized  through  a  fabrication 
process  that  is  better  than  the  worst-case  process  presumed  in  the  design  work 
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and  through  circuit  operating  temperatures  that  are  lower  than  the  worst-case 
temperature  of  85°  C.  The  amount  of  delay  that  is  tolerable  in  the  super  buffer  is 
largely  application  dependent. 


IV.  CONCLUSIONS 


A.  SUMMARY 

This  research  set  out  to  design  a  high-speed  static  CMOS  super  buffer  for  the 
Monterey  Silicon  Compiler.  The  problem  of  choosing  its  dimensions  to  achieve  a 
desired  timing  result  under  worst-case  conditions  was  approached  analytically,  and 
an  algorithm  was  derived  to  solve  the  problem.  The  algorithm  was  implemented 
in  a  computer  program  which  was  used  as  a  computer-aided-design  (C.“.0 )  tool 
to  design  a  hardware  device.  In  SPICE  simulations  the  resulting  device  surpassed 
the  pre-specified  timing  performance  standards  under  the  worst-case  scenario  that 
it  was  designed  to  meet.  Finally,  a  mask  level  integrated  circuit  design  of  this 
device  was  installed  in  the  Monterey  Silicon  Compiler  as  a  part  of  the  technology 
upgrade  of  the  MacPITTS  Silicon  Compiler,  where  it  is  now  available  for  use  in 
the  automated  design  of  custom  hybrid  CMOS/NMOS  VLSI  microchips. 

The  ability  to  use  an  algorithm  to  size  transistors  on  a  chip  is  an  important 
step  toward  silicon  compilation  that  results  in  optimal  time  delay  circuits  because 
computers  can  think  in  terms  of  algorithms,  but  not  in  terms  of  rules  of  thumb  as 
many  engineers  do.  Human  engineers  can  design,  simulate,  and  redesign  until  they 
get  timing  right.  A  silicon  compiler  that  tries  to  achieve  desired  timing  parameters 
for  delay,  rise,  and  fall  time  has  to  be  taught  how  to  think  about  the  problem. 
That  is  perhaps  the  most  exciting  application  of  this  type  of  approach — compilers 
that  design  chips  to  meet  specified  timing  performance  standards.  For  future  re¬ 
search,  the  Monterey  Silicon  Compiler  code  that  was  modified  to  include  the  static 
CMOS  super  buffer  can  be  found  in  the  files  /a/work/steele/inacpitts/data-path.l 
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and  /a/work/steele/macpitts/data-path.o  on  the  Integrated  Solutions  CAD  work¬ 
stations  of  the  Electrical  and  Computer  Engineering  Department  Computer  Labo¬ 
ratories  at  the  Naval  Postgraduate  School.  The  NMOS  logic  cells  in  the  Monterey 
Silicon  Compiler  that  still  require  CMOS  replacements  axe  listed  in  Table  4.1.  This 
table  was  extracted  from  a  similar  list  compiled  by  E.  Malagon  [Malagon,  1985, 
P-95]. 


TABLE  4.1  MONTEREY  SILICON  COMPILER  NMOS  LOGIC 
CELLS  REQUIRING  CMOS  REPLACEMENTS 


_ ORGANELLE _ 

layout  -  =  organelle 

layout  -  equ  -  organelle 

layout  -  <>  organelle 

layout  -  <>  0  organelle 

layout  -  =  0  organelle 

lsh  -  zero /  lsh2/  lsh3/  lsh4/  lsh8 

rsh  -  zero/  rsh2/  rsh3/  rsh4/  rsh8 

layout  -  odd  -  operand 

layout  -  even  -  operand 

port  -  output  1 

bit 

layout  -  inverting  -  super  -  buffer 
15  contacts:  pc,  ndc,  pdc,  nsc,  psc 
entire  controller  section 


B.  RECOMMENDATIONS 


LIBRARY  FUNCTIONS 

word  —  equ 
<> 

<>  0 
=  0 


An  algorithm  for  transistor  sizing  in  static  CMOS  logic  design  has  been  de¬ 
rived  and  a  logic  device  whose  dimensions  were  determined  by  the  algorithm  has 
performed  successfully  in  SPICE  simulations.  It  is  recommended  that  future  re¬ 
search  in  this  area  investigate  the  possibility  of  incorporating  the  following  topics 
in  the  algorithm:  the  effects  of  a  non-ideal  input  square  wave,  empirical  adjustment 
of  the  basic  drain  current  equations,  the  effects  of  channel  length  modulation  and 
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of  the  body  effect  phenomenon,  accuracy  in  computation  and  in  fabrication,  and 
stage  ratios. 

1.  Effects  of  a  Non-Ideal  Input  Square  Wave 

The  derivation  of  the  equations  for  transistor  sizing  in  a  static  CMOS 
inverter  in  Chapter  II  assumes  that  the  input  voltage  waveform  is  a  perfect  square 
wave — one  that  rises  and  falls  instantaneously.  This  type  of  waveform  causes  one  of 
the  MOSFETs  in  the  inverter  to  be  in  cutoff  during  switching  thereby  permitting 
the  circuit  simplifications  represented  in  Figures  2.9b  and  2.10b.  A  more  realistic 
input  voltage  waveform  is  depicted  in  Figure  3.5.  Applying  the  definitions  of  the 
three  regions  of  operation  of  MOSFETs  presented  in  Table  2.1  to  the  input  and 
output  voltage  waveforms  of  Figure  3.5  shows  that  with  a  non-ideal  square  wave 
applied  at  the  input  of  an  inverter,  both  transistors  of  the  inverter  will  momentarily 
conduct  during  switching — one  sourcing  current,  the  other  sinking  current.  This 
obviously  affects  circuit  timing  because  the  rate  at  which  charge  is  delivered  to  or 
taken  away  from  the  load  is  decreased.  There  are  two  possible  solutions  to  this 
in  the  derivation  of  the  equations,  one  difficult,  the  other  easy.  The  more  difficult 
approach  is  to  rederive  the  equations  of  Chapter  II  based  on  a  time  varying  input 
voltage  waveform.  The  easier  approach  is  conduct  a  study  to  find  an  empirical 
adjustment  factor  that  would  modify  the  value  presented  to  the  algorithm  as  being 
the  desired  rise  time  to  compensate  for  the  effects  of  a  less  than  ideal  input  square 
wave. 


All  of  the  drain  current  equations  in  Table  2.1  can  be  multiplied  by  a  factor 
determined  empirically  to  make  the  drain  current  equations  more  perfectly  reflect 
physical  device  behavior  as  suggested  by  Hodges  and  Jackson  [Hodges  and  Jackson, 
1983,  pp.  51-52].  For  n  channel  MOSFETs  the  factor  is  (1  4-  LAMBDA,,  •  Vdsn). 


For  p  channel  MOSFETs  the  factor  is  (1+LAMBDAP  •  |V<f.sp|).  Using  these  factors 
makes  it  necessary  to  rederive  the  transistor  sizing  equations  of  Chapter  II  based 
on  these  new  drain  current  equations. 


3.  Channel  Length  Modulation 

It  is  known  that  devices  with  channel  length  shorter  than  10  microns  satu¬ 
rate  before  the  channel  pinches  off  because  the  electrons  (holes)  reach  the  scattering 
limited  velocity  in  the  channel  [Electronics  Research  Laboratory,  1980,  p.6].  Once 
in  the  saturation  region,  the  channel  becomes  shorter  than  Leff.  To  compensate, 
the  saturation  region  drain  current  equations  in  Table  2.1  can  be  divided  by  a 
channel  length  modulation  factor.  For  n  channel  saturated  MOSFETs  the  factor 
is  (1  —  LAMBDA,,  •  Vdsn)  [Electronics  Research  Laboratory,  1980,  p.  15).  For  p 
channel  saturated  MOSFETs  the  factor  is  (1  —  LAMBDAP  •  |Udsp|).  Making  the 
modifications  necessitates  rederiving  the  transistor  sizing  equations  of  Chapter  II 
based  on  the  modified  drain  current  equations. 

4.  Body  Effect 

In  the  extension  of  the  equations  derived  for  transistor  sizing  in  a  static 
CMOS  inverter  to  general  static  CMOS  logic  design  body-effect  is  neglected.  This 
could  perhaps  be  compensated  for  by  introducing  average  values  for  Vbs„  and  Vbsp 
in  the  same  manner  as  was  used  to  calculate  values  of  Cbdj^v^  {T}  and  CbdsA^%  {T} 
in  equations  (2.26)  and  (2.27). 

5.  Accuracy  in  Computation  and  in  Fabrication 

It  is  important  to  remember  the  first  axiom  of  computing  in  using  the 
transistor  sizing  algorithm  presented  in  this  thesis:  “garbage  in,  garbage  out.” 
If  for  example  the  SPICE  fabrication  parameters  given  to  the  algorithm  are  not 
measured  accurately,  the  algorithm  cannot  hope  to  produce  a  circuit  that  will 
behave  as  desired.  The  same  is  true  if,  for  evample.  during  fabrication  the  pattern 


definitions  created  by  the  lithography  process  are  inaccurate  or  the  diffusion  process 
is  poorly  controlled. 

6.  Stage  Ratio 

For  consistency  of  approach,  the  input  stage  dimensions  of  the  super  buffer 
were  designed  using  the  same  algorithm  used  to  calculate  the  output  stage  dimen¬ 
sions.  This  resulted  in  the  output  stage  being  1.266  times  larger  than  the  input 
stage.  This  value  is  known  as  the  stage  ratio  which  is  the  value  by  which  successive 
transistor  widths  are  multiplied  when  inverters  are  cascaded.  Mead  and  Conway 
have  written  that  the  value  of  the  stage  ratio  that  minimizes  overall  delay  for  a 
series  of  cascaded  inverters  is  given  by  Euler’s  number,  e  =  2.718- ••  [Mead  and 
Conway,  1908,  pp.  12-14].  Thus,  it  may  be  possible  to  achieve  slightly  reduced 
area  for  the  super  buffer  by  using  the  algorithm  to  design  the  output  stage,  then 
using  the  stage  ratio  2.718  to  fix  the  dimensions  of  the  input  stage.  However,  Mead 
and  Conway  as  well  as  Weste  and  Eshraghian  have  all  acknowledged  that  other  de¬ 
sign  criterion  may  steer  a  designer  away  from  a  value  of  2.718  for  the  stage  ratio 
[Mead  and  Conway,  1980,  p.  14  and  Weste  and  Eshraghian,  1895,  p.  197],  Since 
the  relative  time  penalty  for  using  the  stage  ratio  of  1.266  is  slight,  and  since  using 
this  ratio  allows  a  better  assessment  of  the  performance  of  the  algorithm,  the  value 
of  1.266  was  retained  and  the  resulting  circuit  met  its  performance  goals. 

7.  Suggested  Modifications  to  the  Monterey  Silicon  Compiler 

In  a  design  generated  by  the  Monterey  Silicon  Compiler  the  area  reserved 
for  the  super  buffer  on  the  floorplan  is  fixed.  Were  this  area  variable  the  compiler 
could  be  given  several  super  buffers  to  choose  from  and  it  would  select  the  one  that 
best  meets  the  drive  requirements  of  the  data  path.  This  would  involve  modifying 
the  body  of  code  that  defines  the  metal  skeleton  that  distributes  power,  ground, 
and  clock  signals  on  chip. 


The  data  registers  currently  in  use  are  very  tall  and  thin.  Since  the  wires 
that  connect  the  super  buffer  to  the  registers  traverse  the  longest  distance  across 
the  registers  the  routing  capacitance  has  become  unreasonably  large.  The  solution 
is  to  return  to  the  general  configuration  used  by  the  original  NMOS  data  registers. 
These  registers  were  short  but  wide,  greatly  decreasing  the  length  of  the  wires  that 
connect  the  super  buffers  to  the  data  register  and  thus  the  routing  capacitance. 


STATIC  CMOS  INVERTER  TRANSISTOR  SIZING  PROGRAMS 


This  appendix  contains  programs  that  implement  the  transistor  sizing  equa¬ 
tions  of  Chapter  II  for  a  static  CMOS  inverter.  The  programs  run  on  an  HP41CX 
with  a  magnetic  card  reader  and  two  extended  memory  modules.  Main  memory 
must  be  apportioned  so  that  SIZE  >  076. 

Executing  the  programs  in  certain  sequences  occasionally  results  in  the  mes¬ 
sage  “NO  ROOM”  and  a  halt  to  program  execution.  If  this  occurs,  the  following 
corrective  actions  should  be  taken  (in  the  order  listed): 

(1)  Clear  the  current  program  from  main  memory. 

(2)  Place  the  name  of  the  program  that  could  not  be  loaded  in  the  ALPHA  register. 

(3)  Execute  GETP. 

(4)  Execute  R/S. 

Program  execution  should  continue  normally  with  data  storage  registers  and 
user  flags  unaffected  by  the  temporary  halt  in  program  execution. 

The  interrelationships  of  the  programs  are  depicted  in  Figure  A.l.  To  run 
the  programs,  main  memory  must  first  be  entirely  cleared  and  all  programs  must 
be  placed  in  extended  memory.  The  name  “VLSI”  is  placed  in  the  ALPHA  reg¬ 
ister,  GETP  is  executed  followed  by  R/S.  The  program  VLSI  is  responsible  for 
loading  the  SPICE  fabrication  parameters.  These  parameters  must  be  stored  on 
magnetic  data  storage  cards  in  accordance  with  the  data  register  allocation  given 
in  Table  A.l.  The  cards  are  loaded  when  they  are  requested  by  the  program.  The 
program  VLSI  automatically  clears  itself  from  main  memory,  loads  the  program 


Figure  A.l  Interrelationships  of  Transistor  Sizing  Programs 
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TEMP  from  extended  memory,  and  continues  execution  with  the  first  line  of  code 
in  the  program  TEMP.  The  program  TEMP  calculates  the  effects  of  temperature 
on  the  fabrication  parameters  in  Table  A.l,  overwriting  the  data  registers  of  the 
temperature  dependent  parameters  with  their  temperature  altered  values.  Pro¬ 
gram  TEMP  then  clears  itself  from  main  memory,  loads  the  program  TOP-LVL 
from  extended  memory,  and  continues  execution  with  the  first  line  of  code  in  the 
program  TOP-LVL.  The  program  TOP-LVL  acts  as  a  memory  manager,  shifting 
the  programs  PWIDTH  and  LOADCAP  in  and  out  of  main  memory  as  required 
by  the  user.  The  program  PWIDTH  calculates  all  the  SPICE  MOSFET  model 
dimensions  and  drain  resistances  required  for  the  p  MOSFET  and  the  n  MOSFET 
in  a  static  CMOS  inverter  required  to  drive  a  given  load  capacitance.  If  the  load 
capacitance  is  not  known  but  the  dimensions  of  the  load  MOSFETs  are  known, 
the  load  capacitance  can  be  calculated  using  the  program  LOADCAP.  Table  A. 2 
details  the  remaining  data  register  allocation.  The  source  code  for  the  programs 
VLSI,  TEMP,  TOP-LVL,  PWIDTH,  and  LOADCAP  follows  Tables  A.l  and  A.2. 


TABLE  A.l  SPICE  FABRICATION  PARAMETER 
DATA  REGISTER  ALLOCATION 


SPICE 

PARAMETER 

DATA  REGISTER 
FOR  n  DEVICES 

DATA  REGISTER 
FOR  p  DEVICES 

KP 

Roo 

Rl7 

VTO 

Roi 

Rl8 

LD 

Ro2 

Rl9 

CJ 

Ro3 

R20 

CJSW 

Ro4 

R21 

FC 

R<)5 

R22 

MJ 

Ro6 

R23 

PB 

Ro7 

R24 

MJSW 

Ro8 

R25 

PHI 

Rod 

R26 

GAMMA 

RlO 

R27 

* Cmd 

Rll 

R28 

^rel, oxide 

Rl2 

R29 

CGSO 

Rl3 

R30 

CGDO 

Rl4 

R31 

TOX 

RlS 

R32 

UO 

Rl6 

R33 

RSH 

R-3 

R74 

i 

i 


Not  SPICE  parameters 


TABLE  A. 2  DATA  REGISTER  ALLOCATION  FOR  VARIABLES 


OTHER  THAN  SPICE  FABRICATION  PARAMETERS 


VARIABLE 

DATA  REGISTER 
FOR  n  DEVICES 

DATA  REGISTER 
FOR  p  DEVICES 

T  [Kelvin] 

R34 

R34 

EG  {T} 

R35 

R35 

EG  {Tnom} 

R36 

R36 

Temporary 

R37,  R38>  R39,  R40 

R41 ,  R421 R43  •  R75 

Vto  {T} 

R34 

R35  (abs.  value) 

A 

R36 

R36 

Vdd 

R38 

R38 

Channel  Mask  Length 

R66 

R42 

Cbdj^{T} 

R44 

R45 

C  bds^y  g  |l/| 

R46 

R47 

B 

R48 

R48 

tr 

R49 

R49 

£(=  Cr) 

R50 

R50 

Drain  Mask  Length 

R52 

R51 

No.  Drain  Diffusion  Contacts 

R56 

R53 

Diffusion  Contact  Mask  Length 

R57 

R54 

Diffusion  Contact  Mask  Width 

^58 

K55 

C 

R59 

R59 

D 

R«o 

R60 

F 

Rei 

R61 

G 

R«2 

R62 

H 

^63 

R63 

W 

R65 

R64 

AD  =  AS 

R?0 

R67 

PD  =  PS 

R71 

R68 

RD  =  RS 

R72 

1^69 

■ 
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PROGRAM  VLSI 


01  LBL  T  VLSI 
02  T  STOP.  PURGE  MAI 
03  T  (-  N  MEMORY 
04  PROMPT 
05  000.074 

06  T  ENTER  FAB.  PARA 
07  T  H  M.  CARD 
08  CF  09 
09  CF  10 

10  AVIEW 

11  RDTAX 

12  T  TEMP 

13  GETP 

14  END 
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PROGRAM  TEMP 


*  *  * 
21 
22 

23 

24 

25 

26 


LBL  tTEMP 
tOPERATING  TEMP  ( 
Th KELVIN  }  =  ? 
PROMPT 
STO  34 

CALCULATE  EG  {  T  )  *  *  * 
STO  37 
XEQTEQ{ T  } 

RCL  37 
STO  35 

CALCULATE  EG  (  Tnom  )  *  *  * 
300.15 
STO  37 
XEQTEG  (  T  ) 

RCL  37 
STO  36 

CALCULATE  PBn  {  T  )  *  *  * 
RCL  07 
STO  40 
STO  37 

XEQtPB  (  T ) 

RCL  37 
STO  07 

CALCULATE  PBp  (  T  )  *  *  * 
RCL  24 
STO  41 
STO  37 
XEQtPB  {  T ) 

RCL  37 
STO  24 

CALCULATE  PHIn  (  T  )  *  *  * 
RCL  09 
STO  37 

XEQtPB  (  T ) 

RCL  37 
STO  09 

CALCULATE  PHIp  (  T  )  *  *  * 
RCL  26 
STO  37 
XEQtPB  {  T ) 

RCL  37 
STO  26 

CALCULATE  CJn  (  T  )  *  *  * 
RCL  03 

STO  37  ;  C.Jn 

RCL  06 

STO  38  ;  MJ„ 

RCL  07 


wn  y-w  me  virwuvwivww^i  vn jlt  if  wvinmwnnivir.viw)-^;^  v  v.wvjvw 


STO  39  ;  PBn  (  T  ) 

RCL40 

STO  42  ;  PBn 

XEQtCJ { T  ) 

RCL  37 
STO  03 

*  CALCULATE  CJP  (  T  )  *  *  * 

RCL  20 

STO  37  ;  CJP 

RCL  23 

STO  38  ;  MJP 

RCL  24 

STO  39  ;  PBp  (  T  ) 

RCL  41 

STO  42  ;  PBp 

XEQtCJ  (  T  ) 

RCL  37 
STO  20 

*  CALCULATE  CJSW„  (  T  )  *  *  * 

RCL  04 

STO  37  ;  CJSWn 

RCL  08 

STO  38  ;  MJSW„ 

RCL  07 

STO  39  ;  PB„  (  T  } 

RCL  40 

STO  42  ;  PB„ 

XEQtCJ  {  T  ) 

RCL  37 
STO  04 

*  CALCULATE  CJSWP  (  T  )  *  *  * 

RCL  21 

STO  37  ;  CJSWp 

RCL  25 

STO  38  ;  MJSWp 

RCL  24 

STO  39  ;  PBp  {  T  } 

RCL  41 

STO  42  ;  PBp 

XEQtCJ  ( T  ) 

RCL  37 
STO  21 

*  CALCULATE  KPn  (  T  )  *  *  * 

RCL  00 
STO  37 

XEQtKP  {  T  ) 

RCL  37 
STO  00 

*  CALCULATE  KPp  (  T  )  *  *  * 

RCL  17 


CJSWp 

MJSWn 


87 

STO  37 

88 

XEQTKP  ( 

T) 

89 

RCL  37 

90 

STO  17 

*  *  * 

CALCULATE  UOn 

(  T  )  *  *  * 

91 

RCL  16 

92 

STO  37 

93 

XEQtKP  { 

T) 

94 

RCL  37 

95 

STO  16 

*  *  * 

CALCULATE  UOp 

{ T )*** 

96 

RCL  33 

97 

STO  37 

98 

XEQTKP  { 

T) 

99 

RCL  37 

100 

STO  33 

*  *  * 

CALCULATE  erei, 

oxiden  (  Y  )  *  : 

101 

RCL  00 

102 

STO  37 

;  KPn  {  T  ) 

103 

RCL  15 

104 

STO  38 

;  TOXn 

105 

RCL  16 

106 

STO  39 

;  UO„  (  T  ) 

XEQtEREL  (  T  } 

RCL  37 
STO  12 

CALCULATE  erci,  oxideP  {  T  )  *  *  * 
RCL  17 

STO  37  ;  KPp  (  T  ) 

RCL  32 

STO  38  ;  TOXp 

RCL  33 

STO  39  ;  UOp  (  T  ) 

XEQtEREL ( T  ) 

RCL  37 
STO  29 
TTOP-LVL 
GETP 

EG  (  T  )  SUBROUTINE  *  *  * 
LBLtEG  {  T ) 

RCL  37 
X  /  2 
.000702 
* 

RCL  37 
1108 
+ 

/ 

1.16 
X  {  )  Y 


O' 

2»* 

B.« 

k: 


B 


'VWV 
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STO  37 
RTN 

PB  {  T  ),  PHI  {  T  )  SUBROUTINE  *  *  * 
LBLTPB  (  T ) 

RCL  37 
RCL  34 


300.15 

/ 

STO  38 
3 

ENTER  y 
8.6173468  E-05 


;  Interim  Result 


;K/q 


RCL  34 


RCL  34 

300.15 

/ 

LN 


RCL  38 

X<)y 


RCL  35 


RCL  36 
RCL  34 


300.15 

/ 

+ 

STO  37 
RTN 

CJ  {  T  >,  CJSW  {  T  )  SUBROUTINE  *  *  * 
LBLrCJ ( T  ) 

RCL  39 
RCL  42 

/ 

X  (  }  Y 


RCL  34 

300.15 


.0004 


+ 

RCL  38 


mm 


*r  •  ^ 


TO 
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180  * 

181  1 

182  + 

183  RCL  37 

184  * 

185  STO  37 

186  RTN 

*  *  *  KP  (  T  ),  U0<  T  )  SUBROUTINE  *  *  * 

187  LBLtKP  {  T  ) 

188  RCL  34 

189  300.15 

190  / 

191  -1.5 

192  Y  /  X 

193  RCL  37 

194  * 

195  STO  37 

196  RTN 

*  *  *  freji  oxide  {  T  )  SUBROUTINE  *  *  * 


197 

LBLTEREL  {  T  ) 

198 

RCL  37 

199 

100 

200 

* 

201 

RCL  38 

202 

* 

203 

RCL  39 

/ 

8.854187818  E-14 

PROGRAM  TOP-LVL 


LBLtTOP-LVL 
FS?  09 
GTO  30 
CF  02 
CF  03 
ENG  3 

CALCULATE  Vton  (  T  )  *  *  * 
RCL  01 

STO  37  ;  VTO„ 

RCL  10 

STO  39  ;  GAMMA„ 

RCL  09 

STO  40  ;  PHIn  {  T  ) 

TVBS  {  N  ),  VOLTS  =  ? 
PROMPT 
ABS 

STO  41  ;  |  Vbsn  | 

XEQtVTO  (  T  ) 

RCL  37 
STO  34 

CALCULATE  Vtop  (  T  )  *  *  * 
RCL  18 
ABS 

STO  37  ;  |  VTOn| 

RCL  27 

STO  39  ;  GAMMAp 

RCL  26 

STO  40  ;  PHIp  {  T  ) 

TVBS  (  P  },  VOLTS  =  ? 

PROMPT 

STO  41  ;  Vbsp 

XEQTVTO  {  T  ) 

RCL  37 
STO  35 

INPUT  Vdd  *  *  * 

tSUPPLY  VOLTAGE  { 

T  I- VOLTS  )  =  ? 
PROMPT 
STO  38 

CALCULATE  RATIO  “A”;  *  *  * 
Wn  =  Wp  •  A  for  tr  ~  tf 


*  *  * 


RCL  38 

/ 

LN 

2 

/ 

STO  39 
RCL  34 
.1 

RCL  38 


;  Interim  Result 


RCL  38 
RCL  34 

7 

RCL  39 

+ 

RCL  38 
RCL  35 


RCL  17 
* 

STO  39  ;  Interim  Result 

19 

RCL  38 
* 

20 

RCL  35 


RCL  38 

/ 

LN 

2 

STO  40  ;  Interim  Result 

RCL  35 
.1 

RCL  38 


RCL  38 
RCL  35 


RCL  40 
+ 

RCL  38 
RCL  34 


102 


RCL  00 


STO  40  ;  Interim  Result 

TP  CHNL  MASK  LNG 
tKTH  {  MTR  )  =  ? 
PROMPT 
STO  42 

tN  CHNL  MASK  LNG 
tPTH  {  MTR  )  =  ? 
PROMPT 
STO  66 
2 

RCL  02 


RCL  39 


STO  39 
RCL  42 
2 

RCL  19 


;  Interim  Result 


RCL  40 


1/X 

RCL  39 


STO  36 
MENU  *  *  * 

LBL  01 
SF  27 

Twi  l: 

PROMPT 

CALL  TO  P WIDTH  *  *  * 
LBL  A 
CF  27 
tPWIDTH 
GETP 
GTO  01 

CALL  TO  LOADCAP  *  *  * 
LBL  E 
CF  03 
CF  27 
SF  09 

TLOADCAP 
GETSUB 
XEQTLOADCAP 
LBL  30 


K- 


i 


I 


42 

43 

44 

45 

46 

*  *  VTO 


CF  09 

tLOADCAP 
PCLPS 
CF  02 
GTO  01 

(  T  )  SUBROUTINE  *  *  * 
LBLtVTO  {  T  } 

RCL  40 
RCL  41 
+ 

SQRT 
RCL  40 
SQRT 


RCL  39 


RCL  37 

+ 

STO  37 

RTN 

END 
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PROGRAM  P WIDTH 


LBLtPWIDTH 
FS?  10 
GTO  40 
CF  02 

CALCULATE  Cbdjavg  „  (  T  )  *  *  * 
RCL  05 

STO  39  ;  FCn 

RCL  06 

STO  40  ;  MJn 

RCL  38  ;  Vdd 

.9 
* 

STO  41 
RCL  07 

STO  75  ;  PBn  {  T  ) 

XEQtCBDJ 
RCL  37 
STO  43 
RCL  38 
.1 
* 

STO  41 
XEQtCBDJ 
RCL  37 
RCL  43 
+ 

2 

/ 

STO  44 

CALCULATE  Cbdjavg  p  (  T  )  *  *  * 
RCL  22 

STO  39  ;  FCP 

RCL  23 

STO  40  ;  MJP 

RCL  38  ;  Vdd 

.9 
* 

STO  41 
RCL  24 

STO  75  ;  PBp{  T  } 

XEQtCBDJ 
RCL  37 
STO  43 
RCL  38 
.1 
* 

STO  41 
XEOTCBDJ 
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RCL  37 
RCL  43 
+ 

2 

!5T0  47 


*  *  *  CALCULATE  C^gate  *  *  * 


LBL  00 

tDO  YOU  KNOW  CE  L 
tI-OAD  DEV  ? 

AVIEW 
PSE 
SF  27 

tyes  _N0 

PROMPT 

LBL  E 

CF  27 

SF  10 

tLOADCAP 
GETSUB 
SF  03 

XEQtLOADCAP 
LBL  40 
CF  10 

tLOADCAP 
PCLPS 
GTO  00 
LBL  A 
CF  27 
FC?  02 
GTO  03 
LBL  04 

ENTER  MASK  GEOMETRY  DATA  *  *  * 
T DESIRED  RISE  TI 
ThME  (  SEC  )  =  ? 

PROMPT 
STO  49 

tROUTING  CAP  (  FAR 
THAD  )  = ? 

PROMPT 
STO  50 

T P  DRAIN  MASK  LN 
rhGTH  (  MTR  )  ? 

PROMPT 
STO  51 

tN  DRAIN  MASK  LN 
tHGTH  (  MTR  )  ? 

PROMPT 
STO  52 


4 


142 

143 

144 

145 

146 

147 

148 

149 

150 

151 

152 

153 

154 

155 

156 

157 

158 

159 

1 60 
161 
162 

163 

164 

165 
*  * 

166 

167 

168 

169 

170 

171 

172 

173 

174 

175 

176 

177 

178 

179 


tno.  p  diffusion 

Th  CONTACTS 
PROMPT 
STO  53 

tPDIFCTC  mask  ln 
tHGTH  {  MTR  )  ? 
PROMPT 
STO  54 

tPDIFCTC  MASK  WI 
tHDTH  {  MTR  )  ? 
PROMPT 
STO  55 

NO.  N  DIFFUSION 
Th  CONTACTS 
PROMPT 
STO  56 

tNDIFCTC  MASK  LN 
Tf-GTH  (  MTR  )  ? 
PROMPT 
STO  57 

tNDIFCTC  MASK  WI 
tI-DTH  {  MTR  ) 
PROMPT 
STO  58 

CALCULATE  “C”  *  *  * 

RCL  51 
RCL  19 
2 
* 

+ 

RCL  20 
* 

RCL  45 

RCL  21 
2 

RCL  47 


180 

181 

182 

183 


+ 

RCL  52 
RCL  02 
2 


184  * 


185 

186 

RCL  36 

187 

* 

188 

RCL  03 

189 

* 

190 

RCL  44 

191 

* 
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293 

RCL  02 

294 

2 

295 

* 

296 

+ 

297 

RCL  03 

298 

* 

299 

RCL  44 

300 

* 

301 

RCL  58 

302 

RCL  11 

303 

* 

304 

+ 

305 

RCL  57 

306 

* 

307 

RCL  56 

308 

* 

309 

ST+  60 

*  *  * 

CALCULATE  “F 

310 

RCL  42 

311 

2 

312 

RCL  19 

313 

* 

314 

— 

315 

2 

316 

* 

317 

318 

319 

320 

321 

322 

323 

324 

325 

*  *  * 

326 

327 

328 

329 

330 

331 

332 

333 

334 

335 

*  *  * 

336 

337 

338 

339 


RCL  49 

/ 

RCL  17 

/ 

RCL  38 
RCL  35 

1 

STO  61 

CALCULATE  “G” 
RCL  35 
RCL  38 
.1 


RCL  38 
RCL  35 

7 

STO  62 

CALCULATE  “H”  *  * 
RCL  38 
19 
* 

RCL  35 
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341 

* 

342 

— 

343 

RCL  38 

344 

/ 

345 

LN 

346 

2 

347 

/ 

348 

STO  63 

*  *  * 

CALCULATE  V\ 

349 

RCL  62 

350 

RCL  63 

351 

+ 

352 

RCL  59 

353 

* 

354 

RCL  61 

355 

* 

356 

1 

357 

X  (  }  Y 

358 

359 

1/X 

360 

RCL  61 

361 

* 

362 

RCL  62 

363 

RCL  63 

364 

+ 

365 

* 

366 

RCL  48 

367 

RCL  60 

368 

+ 

369 

RCL  50 

370 

+ 

371 

* 

372 

STO  64 

373 

RCL  36 

374 

* 

375 

STO  65 

*  *  * 

CALCULATE  0 

*  *  * 

PARAMETERS 

*  *  * 

RESULTS,  &  R] 

*  *  * 

TOP-LVL  *  *  * 

*  *  * 

CALCULATE  A 

376 

RCL  64 

377 

RCL  19 

378 

2 

379 

* 

380 

+ 

381 

RCL  51 

382 

RCL  19 

383 

2 

AND  Wn  *  *  * 


*  *  * 


*  * 


=  ASr 
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RCL  55 
RCL  19 
2 
* 

+ 

RCL  53 

RCL  54 
* 

+ 

STO  67 

CALCULATE  ADn 
RCL  65 
RCL  02 
2 
* 

+ 

RCL  52 
RCL  02 
2 


RCL  58 
RCL  02 
2 
* 

+ 

RCL  56 
* 

RCL  57 
* 

+ 

STO  70 

CALCULATE  PDF 
RCL  51 
RCL  19 
2 
* 

+ 

RCL  53 
RCL  54 


=  AS„  *  *  * 


RCL  64 
RCL  19 
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440 

441 

442 

443 

444 

445 

446 

447 

448 

449 

450 

451 

452 

453 

454 

455 

456 

457 

458 

459 


*  *  * 

460 

461 

462 

463 

464 

465 

466 

467 

468 

469 

470 

471 

472 

473 

474 

475 

476 

477 

478 

479 

480 

481 


+ 

STO  68 

CALCULATE  PDn 
RCL  52 
RCL  02 
2 


=  PSn  *  *  * 


+ 

RCL  56 
RCL  57 


RCL  65 
RCL  02 
2 


+ 

STO  71 

CALCULATE  RDf 
RCL  55 
RCL  19 
2 


+ 

1/X 
RCL  54 


RCL  53 

/ 

STO  37 
RCL  64 
RCL  19 
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+ 

1/X 
RCL  51 
RCL  19 
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497 

/ 

l 

498 

STO  37 

499 

RCL  65 

500 

RCL  02 

501 

2 

502 

* 

» 

503 

+ 

%  1 

504 

1/X 

V, 

S 

505 

RCL  52 

506 

RCL  02 

Vi 

507 

2 

508 

* 

> 

509 

+ 

A 

v 

510 

* 

$ 

511 

512 

RCL  37 

513 

RCL  73 

ft 

514 

* 

1 

515 

STO  72 

*  *  * 

OUTPUT  SPICE  MOSFET  PARAMETERS  *  *  * 

516 

517 

TL  jP>  = 

ARCL  42 

'X 

-V 

A 

518 

Tb<  MTR  ) 

519 

PROMPT 

Lj 

520 

TW  (  P  )  = 

.V 

521 

ARCL  64 

522 

Tb<  MTR) 

523 

PROMPT 

s 

524 

tAD  {  p  )  = 

a: 

525 

ARCL  67 

L 

526 

Tb<  SQ.  MTR  ) 

527 

PROMPT 

$ 

528 

tPD  (  P )  = 

529 

ARCL  68 

530 

Tb{  MTR  } 

531 

PROMPT 

•yfl 

• 

ft 

532 

tRD  (  P  )  = 

533 

ARCL  69 

S3 

534 

Tl-{  OHM  ) 

535 

PROMPT 

• 

536 

tL(N)= 

537 

ARCL  66 

538 

Th<  MTR  ) 

M 

539 

PROMPT 

ft 

540 

TW  {  N  )  = 

is 

541 

ARCL  65 

Ira 

542 

TH  MTR ) 

Kfl 

543 

PROMPT 

544 

tAD  (  N  )  = 

Sr 

545 

ARCL  70 

vB 

546 

Th(  SQ.  MTR  ) 

547 

PROMPT 

» 

548 

tPD  {  N }  = 

R 

549 

ARCL  71 

K8 

550 

Tl-{  MTR  ) 

yft 

551 

PROMPT 

w 

552 

TRD  {  N  )  = 

Jo 

553 

ARCL  72 

» 

554 

Tl-<  OHM  ) 

W 

555 

PROMPT 

yy 

556 

tTOP-LVL 

Xjj 

557 

GETP 

a3 
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Sm 

558 

LBLrCBDJ 

1 

*  -j 

559 

RCL  41 

560 

RCL  40 

ft 

561 

* 

562 

RCL  75 
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/ 

564 
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fc-s 
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1 

$ 
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yv 
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RCL  39 

Vp 
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* 

569 
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570 
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STO  37 
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LBL  03 

TC  E  LOAD  DEV  (  FARA 
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PROMPT 
STO  48 
GTO  04 
END 
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PROGRAM  LOADCAP 

lbltloadcap 

SF  02 

tLOADCAP 

AVIEW 

PSE 

0 

STO  37 
LBL  05 

tLD  CHNL  MASK  LN 
Tt-GTH  (  MTR  ) 

PROMPT 
STO  39 

TLD  CHNL  MASK  WI 
tDTH  (  MTR  } 

PROMPT 
STO  40 

tDEVICE  TYPE? 

AVIEW 
PSE 
SF  27 

TN  P 

PROMPT 

CALCULATE  WITH  N  DATA  *  *  * 

LBL  A 
CF  27 
RCL  39 
2 

RCL  02 


8.854187818  E-12 
* 

RCL  12 

RCL  15 

/ 

RCL  40 
★ 

RCL  13 
RCL  40 
* 

+ 

RCL  14 
RCL  40 
* 

+ 

ST+  37 

tANOTHER  DEVICE? 


ns 


it*  It*  f 
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AVIEW 
PSE 
SF  27 

T  Y  N 
PROMPT" 

LBL  B 
CF  27 
GTO  05 
LBL  D 
CF  27 
GTO  06 

*  *  *  CALCULATE  WITH  P  DATA  *  *  * 
LBL  E 
CF  27 
RCL  39 
2 

RCL  19 


8.854187818  E-12 

RCL  29 
* 

RCL  32 

/ 

RCL  40 

RCL  30 
RCL  40 
* 

+ 

RCL  31 
RCL  40 
* 

+ 

ST  4-  37 

tANOTHER  DEVICE? 

AVIEW 

PSE 

SF  27 

T_  _  Y  N 
PROMPT 
LBL  06 
RCL  37 
STO  48 
FC?  03 
GTO  07 
RTN 
LBL  07 

TC  E  LOAD  DEV= 
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STATIC  CMOS  SUPER  BUFFER  PSPICE  DECK 


This  appendix  contains  the  actual  PSpice  deck  used  for  the  simulation  of  the 
super  buffer  designed  in  Chapter  3.  The  worst-case  minimum  current  MOSIS 
SPICE  fabrication  parameters  are  included  in  the  deck. 


SUPER  BUFFER 

Ml  3  2  1  1  Ml  L=3.0Um  W=  106.1  Urn  AD=507.3P  PD=246.0Um 
M2  3  2  0  0  M2  L=3.0Um  W=41.01Um  AD=191.4P  PD=102.6Um 
M3  4  3  1  1  M3  L=3.0Um  W=134.3Um  AD=660.9P  PD=3l4.4Um 
M4  4  3  0  0  M4  L=3.0Um  W=51.93Um  AD=231.2P  PD=124.4Um 
Cl  4  0  511.2FF 
VDD  1  0  4.5 
VIN  2  0 

+PULSE  0  4.5  6Ns  O.OlNs  O.OlNs  6Ns  12Ns 
.TEMP  85.0 

.TRAN/OP  l.ONS  42NS  ONS  0.1NS 
.PROBE 

.PLOT  TRAN  V(2,0)  V(4,0) 

.PRINT  TRAN  V(2,0)  V(4,0) 

.OPTIONS  NODE  LIST  OPTS  ACCT  LIMPTS=2000  NUMDGT=8  ITL5=0  ITL4=40 

.WIDTH  OUT  =  80 

.OP 


*  MOSIS  CMOS  WORST  MINIMUM  CURRENT  CASE  SPICE  PARAMETERS 

* 


.MODEL  Ml 

+RD 

-f-LEVEL 

-f-VTO 

+KP 

+GAMMA 

+PH 

+LAMBDA 

+CGSO 

+CGDO 

+RSH 

+CJ 


PMOS 
=  32.70 

=  2.000 
=  -1.000 
=  1.26D-05 

=  0.700 

=  0.600 
=  4.70D-02 

=  4.00D-10 

=  4.00D-10 

=  70.000 

=  3.50D-04 
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+MJ 

= 

0.500 

+CJSW 

= 

2.00D-10 

+MJSW 

= 

0.330 

-f-TOX 

= 

5.50D-08 

+NSUB 

= 

1.12D+14 

+NSS 

= 

0.00D+00 

+NFS 

= 

8.80D+11 

+TPG 

= 

-1.000 

+XJ 

s 

4.00D-07 

+LD 

=s 

4.80D-07 

+UO 

= 

200.000 

+UCRIT 

= 

1.60D+04 

4-UEXP 

0.150 

+VMAX 

= 

1.00D+05 

+NEFF 

— 

0.010 

+DELTA 

* 

= 

1.900 

.MODEL  M2  NMOS 


+RD 

3S 

29.73 

+LEVEL 

= 

2.000 

-fVTO 

zzz 

1.000 

+KP 

3.77D-05 

+GAMMA 

= 

1.500 

+PHI 

ss 

0.600 

+LAMBDA 

rr 

1.60D-02 

+CGSO 

= 

5.20D-10 

+CGDO 

5.20D-10 

+RSH 

30.000 

+CJ 

2.20D-04 

+MJ 

=r 

0.500 

+CJSW 

3.00D-10 

+MJSW 

= 

0.330 

+TOX 

5.50D-08 

+NSUB 

= 

1.00D+16 

+NSS 

= 

0.00D+00 

+NFS 

= 

1.20D+12 

+TPG 

1.000 

+XJ 

= 

6.00D-07 

+LD 

3.20D-07 

+UO 

=: 

600.000 

4-UCRIT 

9.99D+05 

+UEXP 

= 

0.001 

+VMAX 

= 

1.00D+05 

+NEFF 

= 

0.010 

-f  DELTA 
* 

= 

1.200 

.MODEL  M3  PMOS 


+RD 

=  22.16 

+LEVEL 

=  2.000 

+VTO 

=  -1.000 

+KP 

=  1.26D-05 

+GAMMA  = 
+PHI 

+LAMBDA  = 

+CGSO 

4-CGDO 

+RSH 

+CJ 

+MJ 

+CJSW 

+MJSW 

+TOX 

+NSUB 

+NSS 

+NFS 

+TPG 

+XJ 

+LD 

+U0 

+UCRIT 

+UEXP 

-f-VMAX 

+NEFF 

+ DELTA 
* 

.MODEL  M4 

+RD 

+LEVEL 

+VTO 

+KP 

+GAMMA 

+PHI 

+LAMBDA 

+CGSO 

+CGDO 

+RSH 

+CJ 

+MJ 

+CJSW 

+MJSW 

+TOX 

+NSUB 

+NSS 

+NFS 

+TPG 

+XJ 

+LD 

+UO 

+UCRIT 

+UEXP 

+VMAX 

+NEFF 


=  0.700 

=  0.600 
=  4.70D-02 

=  4.00D-10 
=  4.00D-10 
=  70.000 

=  3.50D-04 

=  0.500 

=  2.00D-10 

=  0.330 

=  5.50D-08 

=  1.12D+14 
=  0.00D+00 
=  8.80D+11 
=  -1.000 
=  4.00D-07 

=  4.80D-07 

=  200.000 
=  1.60D+04 
=  0.150 

=  1.00D+05 
=  0.010 
=  1.900 

NMOS 
=  29.19 

=  2.000 
=  1.000 
=  3.77D-05 

=  1.500 

=  0.600 
=  1.60D-02 

=  5.20D-10 

=  5.20D-10 

=  30.000 

=  2.20D-04 

=  0.500 

=  3.00D-10 

=  0.330 

=  5.50D-08 

=  1.00D+16 
=  0.00D+00 
=  1.20D+12 
=  1.000 
=  6.00D-07 

=  3.20D-07 

=  600.000 
=  9.99D+05 
=  0.001 
=  1.00D+05 
=  0.010 
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